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Gravitational lensing in clusters of galaxies is an efficient tool to probe the mass distribu- 
tion of galaxies and clusters, high redshift objects thanks to the gravitational amplification, 
and the geometry of the universe. We review some important aspects of cluster lensing and 
related issues in observational cosmology. 



§1. Introduction: Discovery of giant luminous arcs 



More than a dozen years ago, the 
discovery of extremely elongated and lu- 
minous arc-like images in three distant 
rich clusters of galaxies A3 70, A2218 
and CL2244-02 was reported by Lynds 
and Petrosian 1 ) in the Bulletin of the 
American Astronomical Society. Inde- 
pendently, Soucail et al. 2 ** (see Fig. 1) 
discovered many arcs in the cluster A370 
(z = 0.37). Soon after, Soucail et al. 3 ) 
measured a redshift of z = 0.724 for the 
giant arc in A370 — nearly twice the 
redshift of the lensing cluster — thus 
confirming the gravitational nature of 
the arc, as suggested by Paczyhski. 4 ) 
Giant luminous arcs (GLA) are typically 
bluer, longer and generally brighter than 
(normal) cluster galaxies. 

These discoveries have promoted 
gravitational lensing to the position of 
a major cosmological tool for the fol- 
lowing purposes: i) study the cluster 
mass distribution; ii) probe high redshift 
galaxies using clusters as natural gravi- 



Fig. 1. The discovery of the Giant Luminous 
Arc. An arc-like structure (AO) appeared 
on the CCD image of central part of cluster 
A370 at z = 0.374. 2) The image was taken 
during autumn, 1985, with a 320 x 512 pix- 
els CCD using the Canada-France-Hawaii- 
Telescope (CFHT). The pixel size was ~ 
0.8 arcsec. The strange blotch at position 
~ (—20, 30) is a CCD defect in the original 
CCD. 
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tational telescopes; iii) constrain the geometry of the universe, as parameterized by 
f2o and Aq, and especially the value of the cosmological constant; and iv) search for 
large-scale structures in the high redshift universe. This emerging new area seems 
to represent the realization of Zwicky's 5 )' 6 ) dreams. 

In this paper, we present the status of gravitational lensing studies of clusters of 
galaxies in regard to the above four topics — except the status of the weak lensing 
mass reconstruction, which is reviewed in another paper in this volume. 7 ** 

This review is organized as follows. Section 2 gives the fundamentals of grav- 
itational lensing in order to understand how the cluster mass distribution can be 
constrained in the strong lensing regime. This description is restricted to key equa- 
tions, since several very good reviews on lensing fundamentals have been published. 
The reader who is not familiar with the field of gravitational lensing can consult the 
reviews by Blandford and Kochanek, 8 ) Blandford and Narayan, 9 ) Schneider, Ehlers 
and Falco, 10 ) Fort and Mellier, n ) Wu 12 ) and Narayan and Bertelmann 13 ) and Mel- 
lier. 14 ) Section 3 presents an overview of the modelling of the mass distribution in 
clusters in the strong lensing regime, with a comparison between the lensing and 
X-ray mass estimates. Indeed, a discrepancy of a factor of 2-3 is generally observed 
between strong lensing and de-projection of the X-ray emission of the intra-cluster 
medium (henceforth ICM). Proposed solutions to this problem are summarized. Sec- 
tion 4 discusses the "dark" lens problem in connection to the search for very high 
redshift clusters. Section 5 presents recent applications of the use of clusters of galax- 
ies as natural gravitational telescopes to study the nature of high redshift galaxies. 
Section 6 summarizes the studies connecting the statistics of lensed galaxies in clus- 
ters, nicknamed "arc statistics". Arc statistics are sensitive to: i) the cosmological 
world models (and particularly to i?o an d Aq); ii) the average cluster mass distri- 
bution/profile; and iii) the nature and evolution of high redshift galaxies. Clearly, 
arc statistics depend on a large number of parameters. We therefore attempt here 
to clarify its possible application in order to sharpen focus of considerations on its 
future use. Section 7 summarizes the proposed ideas to constrain the geometry of 
the universe by using the lensing effect due to clusters of galaxies. In each section, 
we attempt to explain how the new generation of telescopes and instruments (SUB- 
ARU, VLT, Gemini, Astro-E, AXAF, XMM, NGST, LSA/MMA, LMSA ...) may 
enrich the cluster-lensing subject. 

This review focuses on various aspects (particularly those we have been deeply 
involved with) and should not be considered exhaustive. However, we believe that 
this focus makes this review unique from others and that it will help readers to catch 
up quickly with the current frontier of this subject. 

§2. Fundamentals of cluster lensing 

Gravitational lensing theory is based on the limit of a weak stationary gravita- 
tional field. Furthermore, in the case of cluster lensing, the size of the deflector is 
much smaller than the light propagation distance. Therefore, the lens can be safely 
approximated by a single plane of matter with a projected surface mass density S{9) 
- usually referred as the thin lens approximation. All our discussion is based on 
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these assumptions. 

2.1. Definition of an arc 

General relativity states that the presence of massive object distorts space-time. 
Thus propagating light beams coming from distant sources will be sensible to any 
mass inhomogeneities 'en route'. The light coming from any extended sources and 
passing through a massive cluster of galaxies will therefore suffer major distortions 
and amplifications due to the gravitational lensing effect. In some extreme cases, 
the distortions are non- linear (on the scale of the image) , and the image is strongly 
curved, having no doubt as to the gravitational nature of the image. Such lensed 
images are usually called "arc", yet it is clearly a subjective definition. In this paper, 
to quantify the definition of a lensed image, we use the term arc for images with an 
axis ratio (length-to-width) larger than 10, and arclet for images with length-to- 
width ratios between 2 and 10. Furthermore, giant arcs brighter than V = 22.5 are 
defined as giant luminous arc (GLA), following the definition of Le Fevre et al. 

2.2. Lensing equation: Mapping and deformation 

The lensing equation can simply be expressed as a mapping from the image plane 
to the source plane. For an angular position of the image 9, the angular position of 
the source (3 is given by the lensing equation 

/3 = 0-VW>(0), (2-1) 

where ip{6) is the effective lensing potential and Ve is the 2D gradient by 6. It is 
important to realize that this equation may admit multiple solutions in for a fixed 
/?, depending on the values of the potential ip. If multiple solution exists, we will say 
that the lens produces multiple images of a source at position (3; this domain defines 
the strong lensing regime (see below). 

The effective lensing potential ip is defined by the Poisson equation, 

VeV = = 2«(0), (2-2) 

where E cr is the critical surface mass density defined by 

c 2 D 

Z cr = _£ (2.3) 

The surface mass density normalized by the critical surface mass density is called 
the 'convergence' k(9). The effective lensing potential can be obtained by solving 
the Poisson equation (2-2): 



$0) = - f K (0*)ln|0 - 9'\d 2 9'. (2-4) 
vr J 



A surface element of the image plane 59 is related to a surface element of the 
source plane 5(3 via the inverse of the magnification matrix A, defined by 
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where VgVg is 2 x 2 diadictensor and J is 2 x 2 unit matrix. The magnification 
matrix can be written as a function of the second derivatives of the potential. In 
Cartesian coordinates, it reads 

(-, _ <P± d 2 tp \ 

d 2 j, i _ d4 > ( 2 ' 6 ) 
d6 X d6y 1 W$ ) 

and in polar coordinates, 

i _ d 2 ip _d_ (ld±\ 

4 -i _ [ ~58* de \edif) 1 / 9 . 7 n 

71 — I a / 1 dib\ -, i d± _ i d 2 i> I ' v z v 



\f dip J 



i _ 1 

1 6» ae W W 2 




where 9 = \9\, and 99 is the azimuthal angle of the image position. Note surface 
elements of image and source planes should be taken as 59 = (59,95p) and 5(3 = 
(5f3,P5ip') in the polar coordinates, where (3 = \0\ and ip' is the azimuthal angle of 
the source position. 

In the limit of a single lens plane, the magnification matrix is real and symmetric. 
Thus in this limit it is diagonalizable. The transformation of a surface element is 
non-linear and can be expressed by two effects, an isotropic magnification expressed 
by the convergence k and a deformation expressed by the complex shear 7: 

1 --+•»)■ (2 ' 8) 

where 7 = 71 + 272 = 7(cos(2#a) + isin(26' J 4)) with 9a giving the shear direction. 

In the shear coordinates, the magnification matrix is diagonal with proper values 
1 — k — 7 an d 1 — K+7. The total magnification fi is defined by the determinant of the 
magnification matrix (when it is defined): fi = detA = 1/((1 — k) 2 — j 2 ) = H{9)~ 1 , 
where H(9) is the Hessian of the lens mapping. Depending on the sign of the 
total magnification fi, the parity of the image will change. For fi > 0, the image 
has the same parity as the original source, even, while for \i < 0, the image has 
mirror-symmetry compared to the original source, and the image parity is odd. The 
family of points for which the inverse of the magnification matrix is singular (infinite 
magnification: fi^ 1 = 0) are called critical lines. They correspond to k + 7 = 1 
(external) or k — 7 = 1 (internal). The corresponding curves in the source plane are 
called caustic lines. 

For a non-uniform mass distribution, if S(6) exceeds U cr in a region of the image 
plane, it ensures that the equation fi^ 1 = admits a solution. Thus critical lines 
exist. Clusters with a surface mass density higher than the critical surface mass 
density are called critical lenses, and such a surface mass density is said to be super 
critical. 

Arcs occur close to and across critical lines (as the merging of two or more 
"multiple images"). They are strongly deformed and highly magnified. The region 
in which arcs can be found is referred to as the strong lensing regime. 
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2.2.1. Case of a circular symmetric lens 

Although the circular symmetric case is an unrealistic description of a cluster 
mass distribution, it is helpful to understand the gravitational lensing. We shall 
discuss it in more detail here. 

Indeed, in the case of a circular symmetric lens, the lens equation reads 

and the inverse of the magnification matrix is (in polar coordinates) 

W* ° \ (240) 

\o o i-^fflj' (210) 

where M(0) is the lens mass enclosed within a radius Dd#, 

M{6) = 2ttDI / E(9')9'd9'. (2-11) 
•/ o 

The factor ^ in (2-10) is the tangential deformation factor of the image. This 
can easily be understood by geometrical considerations. In the case of a circular 
symmetric lens, a light ray can only be radially displaced. Now, consider a (small) 
source at position (3 from the centre of the mass distribution with a length I in 
the tangential direction. Its image is located at position 9 with a tangential length 
V = I x as only radial displacements are allowed. Similarly, (^§) _1 is the 
deformation factor of the image in the radial direction, because this is the ratio of 
the unlensed source width, 5(5, to the lensed image width, 59. 

When the surface mass density follows a power law of the form E(9) = Cq9~ 5 
with 5 < 2, the magnification matrix can be written as 

/ 1 _ 2Co fi-S\ l n \ 

^= 1 • (2-12) 

In the circular case, critical lines are easily defined, by symmetry considerations; 
they are circles. The tangential (external) critical curve given by (5/9 = is a circle, 
called an 'Einstein ring'. The Einstein radius is defined by 

1 /M(6» E )\ 1/2 



The radial deformation factor at the Einstein radius is 



fd(3\ 


-i 




-i 


\d6j 









(2-14) 

When the surface mass density follows a power law of the form U(9) oc 9~ 5 , the 
radial deformation factor can be written as 
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Therefore, for a lens with 5 = 1, which corresponds to the SIS model, the radial 
deformation factor is unity, while a more compact mass distribution (<5 > 1) forms 
narrower arcs (the radial stretching rate is less than 1), and a shallower mass dis- 
tribution (6 < 1) produces thicker arcs (the radial stretching rate is larger than 

!)• 

The radial (internal) critical curve given by d/3/dO = is either a point (if the 
mass distribution is singular) or a circle of radius 

i 

i (M(9 R )y ( i x 7 



"*=^Urj (i^rn 1 ■ (216) 

When E{0) oc (T" 5 , the convergence and the tangential deformation factor at the 
radial critical radius are given by 



2 VI 



(2-18) 



P 

Equation (2-17) shows that S < 1 is required for a lens to have a radial critical 
line, since the convergence must be positive. Equation (2T8) shows that for a lens 
with 5 = 0.5, the tangential deformation factor is unity, while a more compact mass 
distribution (1 > 5 > 0.5) forms thinner arcs and a shallower mass distribution. The 
case 5 < 0.5 produces thicker arcs. 

If the source position is slightly off of centre, the ring is split into two tangential 
highly stretched arc lying close to the Einstein ring. The typical angular separation 
of images is of order 2#e- Therefore, if a pair of arcs or multiply imaged arcs are 
found in the cluster core, the Einstein radius can be estimated from the distance 
between the arc and cluster centre, and a very crude estimation of the mass of the 
cluster within the Einstein radius can be estimated by 

M(0e) = 7T0|i; cr . (2-19) 

The critical surface density U cr is obtained by measuring the redshifts of the lens 
cluster, z<i, and source, z s , by fixing the values of J?o, and Hq. For example, for 
z s = l and z d = 0.3 with (^ ,^o,^o)=(l-0,0.0,100/ikm/sec/Mpc), we have 

U CT = 1.09/i _1 g/cm 2 , (2-20) 

which is of the order of the weight of a sheet of paper. 

Before the discovery of giant arcs, the Coma cluster was the archetype of a 
rich cluster of galaxies. The central surface mass density of the Coma estimated 
from its X-ray emission, assuming spherical symmetry and isothermal hydrostatic 
equilibrium, is given by 

2 — <°> - h 3 ^ - °- 37 (sssv) (m) (200W g cm "< 

(2-21) 
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where the electron density is assumed to be described by n e (r) = n e (0)(l + (^-) 2 )~'2^ . 
Since this value is smaller than the critical surface mass density, it was believed that 
clusters of galaxies could not produce strong lensing events. Clearly, the discovery 
of giant arcs has revolutionized the view we had of the mass distribution of cluster 
cores. 

2.2.2. Spherical lens models 

The analytical formulae of frequently used spherically symmetric mass distribu- 
tion models are summarized in this subsection. 



• Singular isothermal sphere (SIS) model 

A singular isothermal sphere (SIS) is a solution of the collisionless Boltzmann 
equation. The word "isothermal" means here that the velocity dispersion of the 
system is isotropic and uniform. SIS is frequently used in lensing analysis, since the 
density profile of SIS is very simple, and most quantities related to gravitational 
lensing are described in simple analytic forms. It is instructive to examine lensing 
properties using SIS (see, for example, Turner, Ostriker and Gott III 16 )). 

The mass density of SIS is described by 



P(r) = Po 



2 



a 2 



2nGr 2 ' 



(2-22) 



where ro is the scaling parameter, po is the mass density at ro, and a 2 is the line of 
sight velocity dispersion. 

Integrating the above equation along the line of sight, one obtains the surface 
density of the singular isothermal sphere at an angle 9, 



a 2 



The lens mass enclosed within a radius is found to be 

M{6) = ™?M. (2-24) 
G 

Although the mass within a finite radius does not diverge, the central density and 
the total mass diverge. Therefore this model is not a physical representative of a re- 
alistic mass distribution and should only be considered as a simple model to quickly 
estimate physical parameters. 

•Isothermal sphere with a finite core radius 

To avoid the divergence found in SIS, a finite core can be added. The density 
profile of an isothermal sphere with a finite core radius is therefore defined by 




p(r) = p 1 + , (2-25) 
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where po is the density at the centre, and r c is the core radius. 

The surface density of an isothermal sphere with a finite core radius is 

m = ^=24, (2-26) 

where 6 C = r c jD&. The mass enclosed within the radius 9 is 

M{6) = 2TT 2 D 3 dPo 9 2 c + 02 _ e)j . (2-27) 

Similarly as the SIS, lensing quantities of an isothermal sphere with a finite core 
radius are also described by simple analytic functions. However, the total mass of 
this model is still infinite. 

•Modified Hubble law 

The modified Hubble law is used as an analytical approximation of the King 
model, which is one of the solutions of the collisionless Boltzmann equation. The 
King model can be approximated at small radius by the modified Hubble law. 

The mass density profile of the modified Hubble law is given by 

/ 2\- 3 / 2 

p(r) = Po { 1 + , (2-28) 

where po is the density at the centre, and r c is the core radius. Integrating the 
density distribution along the line of sight, one can obtain the surface density 

= • < 2 ' 29 > 

The projected mass of the modified Hubble law is 

M{6) = 2nDle 3 cPo log (^-^j ■ (2-30) 

Since the mass density vanishes in the limit r — > 00 faster than SIS, it is difficult 
to produce strong lensing events for a finite mass unless it is extremely compact and 
massive. The modified Hubble model was conventionally used in describing a galaxy 
distribution in a cluster. 

•Isothermal [3-model 

It is empirically known that (generally) the radial profile of the X-ray surface 
brightness can be described well by the so-called /3-model among X-ray astronomers, 
that is, 

-3/3+1/2 

(2-31) 



S x (r) = So 



1 + 



yi2 



If the intra-cluster gas is isothermal (the gas temperature is constant throughout the 
cluster) , and in hydrostatic equilibrium, the electron density profile is obtained from 
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this formula as n P (r) = n e (0)(l + (r/r c ) 2 y^/ 2 . Then we can solve the hydrostatic 
equation to obtain the cluster mass distribution. The mass density distribution of 
the isothermal /3-model is given by 



3+( 



r r 



\2 



P{r) = P ° [l + (r/r c ) 2 r (2 ' 32) 

where r c is the core radius. The parameter p$ is determined from X-ray observations 
by fitting the /3-model to the X-ray surface brightness. It is given by 

3pk B T 

P0 = toGpmnrV (2 ' 33) 

where (5 is the slope parameter in the /3-model, fcs is the Boltzmann constant, T is 
the temperature of the X-ray emitting gas, \i is the mean molecular weight, and mn 
is the Hydrogen mass. The surface density is 

m = -PoD ^f e + 2 p 2 . (2-34) 

The projected mass of the isothermal /3-model is 

M{9) = 2<ir 2 D 3 d p e 2 c ^ +e2 . (2-35) 

The temperature required to have an Einstein ring radius of 9e is written as 
* B X=^ w *|L^f7e (2 . 36 ) 

This formula is useful to check whether X-ray results are consistent with strong lens- 
ing mass estimates, assuming that the mass distribution is spherical. 

•NFW profile 

The NFW universal density profile is a fit of a relaxed system simulated by 
A^-body simulations with a high resolution. 17 ) According to Navarro, Frenk and 
White, 17 ) all profiles found in the simulations have the same shape, independent, of 
the halo mass, initial density fluctuation spectrum, and values of the cosmological 
parameters. (For these reasons it was called 'universal'.) 

The NFW density profile is given by 

P(r) = ^ x2 , (2-37) 
f fl + f 



where p cr is the critical density, S c is the density parameter, and r s is the scaling 
parameter. 

One obtains the surface density by integrating the mass density along the line 
of sight. The surface mass density is, for 9 < 9 S (= r s /D^), 



17(0) = 2p D d 9 s 



26i , Os-0 9j 

-arctanlV 



(02 _ 02)3/2 \e s + e 9 2 -9 2 



(2-38) 
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for 9 = 9 S , 

and for 9 > 9 S , 

U(9) = 2 Po D d 9 s 



U(9) = -p 9D d , 



291 



(0 2 - #2)3/2 



arctan 



+ 



+ 9 S 9 2 -9 2 



(2-39) 



(2-40) 



where p = 5 c p CT . For 9/9 s < 10~ 3 , E oc 9~ 01 . For 10~ 3 < 9/9 s < HT 1 , E oc 9-° A . 
For 10" 1 < 9/9 s < 0.3, E oc /T a6 . For 0.3 < 0/0, < 1, £ <x 0~ a95 . For 1 < 9/9 s < 
10, E oc 6>- L5 . For 10 < 0/0, , 17 oc 6>~ 2 . 
The projected mass for < S is 



3^3 

s 



M(0) = 47rp O -Dd : 
for 9 = 9 S , 



29 s 



l0g 2 s + v^J^F 



=arctanh 




M(0) = 47r POJ D^ s 3 (log^- + l), 



and for 9 > 9 S , 



M{9) = AirpD^O 



20, 



29 s ' Ve^Ws 



= arctan 



(2-41) 
(2-42) 

(2-43) 



The formulae for the NFW density profile are found in Bartelmann 18 ) and Maoz et 
al. 19 ) 

2.2.3. Pseudo-elliptical lensing potential 

If one would like to construct a physically motivated cluster mass model, one 
should start from the 3D mass distribution. However, as long as the mass distri- 
bution of the lens is thin enough to be described sufficiently well by the thin lens 
approximation, only the surface mass density affects the lensing phenomena. Fur- 
thermore, it is usually complicated to construct the projected mass distribution and 
its lensing properties' model based on a realistic 3D mass distribution even for an 
ellipsoidal mass distribution (see the next subsection). Therefore, the elliptical sur- 
face mass density distribution and the elliptical lensing potential have been widely 
used in lens modelling because they provide simple analytic descriptions of physically 
motivated mass models. In this subsection, the nature of the non-singular pseudo- 
elliptical lensing potential Eq. (2-44) is summarized, since it has been widely used 
for lens modelling, and the behavior of the arcs produced by the pseudo-elliptical 
lensing potential illustrates typical elliptical lensing configurations. 

The non-singular pseudo-elliptical lensing potential is defined by 

m = ^4tt4[#o 2 + (1 - + (1 + e)0 2 2 ] 1/2 , (2-44) 
u s c 

where e = (a 2 — b 2 )/(a 2 + b 2 ) is the ellipticity of the potential. The ellipticity of the 
projected mass density is ~ 3e for small e. 20 ) When e = and 9q = 0, a v is exactly 
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the line-of-sight velocity dispersion (ID velocity dispersion) of a singular isothermal 
sphere, and it is a constant throughout the cluster. 

However, in the case of a finite core radius, interpretation of a v requires caution. 
In the circular potential limit (e = 0), a v can be related to the ID central velocity 
dispersion of the original 3D mass distribution, 20 ) o"o, by assuming an isotropic 
velocity distribution, self-gravitating spherical equilibrium system such as 

al = 2 -a 2 v . (2-45) 
These are related to the central line-of-sight velocity dispersion, 20 ) a\ os : 

°2»(0) = f <rg = \*l (2-46) 
They are also related to X-ray observables 64 ) by 

°l = \°l = , (2-47) 

where the isothermal (3 model is used for modelling the cluster mass distribution. 

The non-singular pseudo-elliptical lensing potential should be used with caution, 
because for \02\ ^> \0±\ 3> 0q the mass-isodensity contours are convex only if |e| < 1/5. 
For e > 1/5, the mass-isodensity has a complex shape, something like a dumbbell. 21 ) 

The ellipticity of the mass distribution of a self-gravitating non-rotating colli- 
sionless system (which is likely to be the case for cluster of galaxies) is limited by the 
dynamical instability, 22 ** and the axis ratio must be smaller than a/b ~ 2.5. Thus we 
expect that in many cases the elliptical potential cannot be a good representation of 
a physically motivated mass distribution. 

2.2.4. Elliptical mass distribution 

In this subsection, we first consider the lensing due to an oblate spheroidal as an 
example of a 3D ellipsoidal mass distribution of the lens. 23 ) The density distribution 
of the oblate spheroidal is described by p(a), where a is defined by 



z 



12 



a 2 = x' 2 + y' 2 + ^ for 1 > e > 0. (2-48) 

1 — e z 

When the symmetry axis z' titled relative to the line of sight by an angle 7, the 2D 
surface mass density is described by 



V 1 - e 2 sin 2 7 A 2 Va 2 - b 2 

where r cut is the cut off radius of the cluster mass distribution in the direction of the 
major axis of the oblate spheroid, Xi and yi are coordinates in the lens plane, which 
is perpendicular to the line of sight, and b is defined by 

fr 2 = * 2 + ^ ^7-5-. (2-50) 

1 1 - e 2 sm 2 7 v ' 
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The complex formulation of the lensing equation is convenient in this case, and the 
lensing equation is written as 

Z S = Z--I*(Z), (2-51) 

7T 

where Z s = x s + iy s is the source position, Z = Xi + iyi is the image position in 
complex form, and I* is the complex conjugate of the scattering function, I(Z), 
which is defined by 

^ f, — 12~^: / ,min(rcut ' 6) dh , b'K(b') 



dx' l dy l l „(b') Y - Yl = 2n^^l - e 2sin2 7 ^ 



\/ Z 2 — 6 /2 e 2 sin 2 7 
(2-52) 

A more detailed derivation of this equation is given by Asano and Fukuyama. 24 ) 
This result shows that only the mass within the isodensity contour at the position 
of the image, b, contributes on the lensing equation. 24 ) 

For example, an image positioned on the minor axis will be sensitive to the mass 
within an ellipse and thus with a larger area compared to the circular symmetric 
case. Therefore, an elliptical lens need not be super critical to have multiple images 
(see the next section). 

We now discuss the properties of the pseudo-isothermal elliptical mass distribu- 
tion, since it is one of the most useful elliptical surface density profiles, and has been 
extensively applied to the simple mass estimate of lens cluster. 25 ) The convergence 
of the pseudo-isothermal elliptical mass profile is given by 

«(&) = -^=, (2-53) 

where kq is the convergence at the centre, and bo is the core radius. One obtains the 
scattering function / from Eq. (2-52) for Eq. (2-53) with r cut — ► oo: 

. /l-e 2 sin 2 7l Uyfift + ^ sin 2 7 + - 6 2 e 2 sin 2 j 

I(Z) = 2ttkq9 c —-\ In = 

tZ V e 2 sm 7 [ ^ c ^e 2 sin 2 7-v^ 

(2-54) 

From the scattering function we can then easily compute the lensing properties of 
this mass distribution. Although this mass distribution does not have a finite total 
mass, a truncated version of that model can be constructed (see Kneib et al. 32 )). 
The lens properties of the pseudo-isothermal elliptical mass profile are similar to 
those of the pseudo-elliptical lensing potential, in particular the behavior of image 
configurations. 25 ) However, there are several qualitative differences between these 
two. A pseudo-isothermal elliptical mass distribution can have naked cusps for any 
value of the central convergence or the core radius, whereas a physically meaningful 
pseudo elliptical lens potential can only have naked cusps when kq < 5. Since the 
convergence is given in the pseudo-isothermal elliptical mass profile, the contour of 
the isodensity is convex for all values of the ellipticity, and the surface density is 
always positive (which is not always the case for an elliptical potential). 
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2.3. Multiple images and time delay: Folds, cusps, lips and beak-to-beak caustics 

When a light ray propagates through a deep gravitational field, the arrival time 
of the light to an observer is delayed by two effects: i) a geometrical effect and 
ii) a gravitational effect. The geometrical delay is explained by the deflection of 
the light by the gravitational field. The gravitational delay is due to the fact that 
a deep gravitational potential is equivalent to a medium with a refractive index 
(1 + 2(j)/c 2 )~ 1 , which is larger than 1 (</> is the 3D gravitational potential defined so 
that at infinity 0=0). Hence, the effective speed of light propagating through a deep 
gravitational potential is "slower" and results in a delay of the arrival time. 

For a source at position (5 in the source plane, the time delay at position in 
the image plane is given by 26 ) 



t = (i±*)AiA 



D, 



'ds 



(2-55) 



where z& is the redshift of the lens, D S1 and D& s are angular diameter distances 
of the observer to the source, the observer to the lens, and the lens to the source, 
respectively, and ip{6) is the effective gravitational lensing potential, which corre- 
sponds to the gravitational time delay. The effective gravitational lensing potential 
is defined by Eq. (2-4), and its amplitude depends both on the mass distribution of 
the lensing object and the distance to the source. The term \{6 — (3) 2 corresponds to 
the geometrical effect. Fermat's Principle states that a light ray follows a trajectory 
such that the light-travel time is stationary relative to neighboring trajectories. In 
other words, the images are located at those points where the time-delay surface t{9) 
is stationary: Vet(6) = 0. This defines the lensing equation (2-1), where Vg is the 
2D gradient by 6. 

The number of stationary points in the time-delay surface corresponds to the 
number of expected images. This number depends on the source position in the 
sky, the lensing parameter, which depends on the redshift of the lens and the source 
and depth and shape of the lens potential, and (,!?o, Aq) through angular diameter 
distances. The behavior of the time-delay surface is described well by the catas- 
trophe theory. 27 ) In the case of lensing theory, the number of independent control 
parameters is three, e.g., two parameters for specifying the source position in the 
sky and one lensing parameter. Therefore, elementary catastrophes that play a role 
in lensing are folds, cusps, swallow tails, hyperbolic umbilics and elliptic umbilics. 

For a circular symmetric lens, the nature of the time-delay potential depends 
only on the radial distance of the source position from the lens centre. In other words, 
the number of the control parameters is one. Therefore, Tom's catastrophe theorem 
implies that only the fold catastrophe appears in a circular symmetric lens (Fig. 
2). For an elliptical lens, the time-delay potential depends on the two-dimensional 
positions of the source, and hence the number of control parameters is two. There- 
fore, the cusp and fold catastrophe can appear in this case (Fig. 3). When a source 
crosses a fold caustic from inside to the outside, two images with opposite parity will 
become closer, then merges at the location of the critical line and finally disappear, 
when the source moves outside of the caustics. When a source crosses a cusp caustic 
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Fig. 2. The caustics and critical lines of a circular lens with a finite core. Left and right panels 
describe the caustics in the source plane and the critical lines in the image plane, respectively. A 
tangential (radial) caustic is a central point in the source plane, and the corresponding critical 
line is a outer circle in the image plane. The appearance of the images corresponding to each 
source position are also shown. 

from the inside to the outside, three images with opposite parity will become closer, 
and then merge at the location of the critical line, and only one image with the same 
parity as the original source parity remains when the source moves outside of the 
caustics. If a cusp caustic lies outside the radial caustic as described in Fig. 3, such 
a cusp is termed "naked cusp." If a source lies close to a naked cusp, one or three 
highly magnified images appear without any additional images. The first discovered 
giant arc in A370 is likely to be a highly magnified image due to a naked cusp (Fig. 
9). 

For simple gravitational lenses (with one main clump) we can distinguish be- 
tween two types of caustics. Tangential caustics occur when the deformation of the 
merging images is tangential (nearly parallel to the critical line). Radial caustics 
occur when the deformation of the merging images is radial (nearly perpendicular 
to the critical line). 

Detection of a radial arc introduces a strong constraint on the degree of the 
central mass concentration in the cluster. For example, in the case of a non-singular 
mass distribution (finite core) we expect a radially stretched image. This is clearly 
seen in the case of a circular symmetric lens (Fig. 2). When the source approaches 
the radial caustics, two images approach each other across a radial critical line along 
the radial direction and merge into a radially stretched single image. If the den- 
sity distribution of the lensing object in the central region can be approximated by 
p oc r~ 7 and 7 < 2, the gravitational potential and the refractive index are finite at 
the centre of the lens. Hence, the gravitational delay is finite, and a light ray can find 
its way through the centre of the lens. On the other hand, if 7 > 2, the gravitational 
potential is singular ( e.g. <p{r) — > — 00 when \ f\ — > 0) and the gravitational delay 
becomes infinite at the centre. Therefore, a light ray which passes near the centre 
never arrives to the observer. For example, a point mass lens and a singular isother- 
mal sphere do not have radial caustics and will produce two multiple image. 8 ) " 14 ) 
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Fig. 3. The caustics and critical lines of an elliptical lens. Left and right panels describe the caustics 
in the source plane and the critical lines in the image plane, respectively. Four cusp caustics 
appear at the corners of diamond caustics. The other parts of the caustics are folds. The 
appearance of the images corresponding to each source position are also shown. The minor axis 
of the elliptical lens potential is the vertical direction. The ellipticity of the lens potential is 
larger in the case shown in bottom figures than in the others. Naked cusps appear. When a 
source lies on a naked cusp, one highly magnified merged image appears without any additional 
images. 
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On the other hand, the NFW density profile may have a radial arc 18 ) since it has 
7 = 1 in the central region. It has been sometimes said that the detection of a radial 
arc is evidence of the finite core radius of the cluster mass distribution. n ) However, 
the only conclusion that can be drawn from the detection of the radial arc is that 
the density profile of the cluster central region is flatter than p oc r~ 2 , e.g. 7 < 2. 



Fig. 4. The time delay surface for a circular lens with j3 = (left panel) and /3 7^ (right panel), 
where the source position is shifted to negative {/-direction. A contour line corresponding to the 
same height as a saddle point of the time delay surface is also drawn. Each stationary point 
is named according to their second derivatives: H(high), S(saddle) and L(low) denote the local 
maximum, the saddle and the local minimum points, respectively. The isochrone appearing in 
the right panel is named "limacon" . 

The time-delay surface helps us to visualize image configurations for the complex 
lens model. As we can see from Eq. (2-55), the geometrical delay surface is a concave 
paraboloid, and the gravitational delay adds a bump to the paraboloid. For a circular 
symmetric lens with a finite core radius and a perfect alignment source-lens-observer 
(Fig. 4), (3 = 0, and the time-delay surface will display one maximum (H) and one 
circle as minimum (L). The circle L corresponds to the Einstein ring. For small 
but finite values of /?, the Einstein ring disrupts into one saddle point (S) and one 
minimum. The total number of the stationary points (hence the number of images) 
is 3. The total parity of each stationary point is even for H and L images, and odd 
for S image. If we increase /?, H and S will merge (at the critical line) along the radial 
direction forming the radial arc, and then disappear, and only the L image remains. 
Note that if the central potential depth is infinite, there is no maximum H and no 
radial arc can exist. For an elliptical potential and in the perfect alignment case, 
the ring L disrupts into two Ss (along the major axis of the potential) and two Ls 
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Fig. 5. The time delay surface for an elliptical lens with (3 = (top left panel) and /3 X > (/?„ = 0) 
(top right and bottom panels). Contour lines corresponding to the same height as a saddle point 
of the time delay surface and the name of each stationary point are also indicated. In the top 
two panels, two saddle points appear. Shifting the source position further from the top right 
panel, two Ls and one S merge into one L, and one obtains the case described by bottom panel. 
The top right panel has both a limagon and a lemniscate as isochrones. 
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(along the minor axis of the potential), because the gradient of the effective lensing 
potential is different along major and minor axes (Fig. 5). First we consider the 
case in which a cusp lies inside of the radial caustic. If we shift the source position 
from perfect alignment along the major axis, two Ls and one S will move closer along 
a curved trajectory tangential to the lens potential contour and merge, forming a 
tangential arc. The position of the source at this stage lies at the cusp caustics. 
If we further shift the source position, the merged image converges into a single L 
image, and H and S approach along the radial direction and merge into one radial 
arc, where the source is on the radial caustics. Further outward shifting of the source 
position causes H and S to disappear, and only the L image remains. 

In the case of a naked cusp, first H and S approach along the radial direction 
and merge into one radial arc, where the source is on the radial caustic, and finally 
they disappear if we shift the source position from the perfect alignment along the 
major axis. If we further shift the source position, two Ls and one S become closer 
and merge, forming a tangential arc. The position of the source at this stage lies at 
the naked cusp caustics. The elliptical lens can have five or three or one images, and 
the total number of images always changes by ±2 when the source crosses a caustic. 

In the case of an elliptical lens, one can see another type of important caustic 
for cluster lensing, that is lips caustics (Fig. 6). We start with the perfect alignment 
case (Fig. 5). As decreasing the depth of the effective lensing potential, a gradient 
along the major axis is getting flatter. Finally, two Ss disappear and a central H 
changes into S. Such caustics are called 'lips caustics' because of their shape. 10 ) The 
time delay surface and shape of the lips caustics are drawn in Fig. 6. If an extended 
source fills in the lips caustics, one S and two L images merge into one straight arc. 
If we further decrease the depth of the effective lensing potential, one S and two Ls 
merge and disappear, and only one L remains. Since the Laplacian of the time-delay 
potential reads 

At{9) oc 2 - AiP0) = 2(1 - k0)), (2-56) 

the condition of the maximum point, At(9) < 0, reads k{9) > 1. Therefore, to have 
an H image, the surface mass density of the lens must be super critical, and the 
lens must be super critical for the circular symmetric lens to be able to have strong 
lensing events and for the elliptical lens to have five images. However, in the case of 
lips caustics, there is no H, and the lens does not have to be super critical. These 
diagnostics of the lips shows that the lips caustics appear when the surface mass 
density of the lens is marginally critical. 

The beak-to-beak catastrophe is another example of a marginally critical lens. 10 ) 
It is formed of two marginally critical lenses close to each other in the same lens 
plane. In Fig. 7 the behavior of time delay surfaces for the observer at the middle 
of two circular lenses in the case that the strengths of two equal lenses are similar, is 
shown. Decreasing the strength of the lens (that is, decreasing the absolute value of 
the amplitude of the effective lensing potential) corresponds to putting the source at 
lower redshift for a fixed lens mass distribution and fixed cosmological parameters. 
In the case shown in Fig. 7, two L images and one S image appear along the line 
perpendicular to the line across the two lenses when the source is placed in the 
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Fig. 6. The lips caustic. The time delay surface for the marginally critical elliptical lens with (3 = 
is shown in the top panel. The major axis of the elliptical lens potential is the x-direction. 
The bottom left and right panels show the caustic and critical line in this case, respectively. 
This caustic is called a "lips caustic" because of its shape. The appearance of the images 
corresponding to each source position are also shown. If the central source size is larger and 
touches the fold caustic line, three multiply lensed images merge into one straight arc. Even if 
the source appears on the cusp, the curvature radius of the arc is large, and a nearly straight 
arc is obtained. 



middle of the two lenses. When decreasing the strength of the lenses, two Ls and 
one S approach and merges, and only one L image remains. The time-delay surface 
and caustic for this case are shown in Fig. 7. These are termed a 'beak-to-beak 
caustic' Both lips caustics and beak-to-beak caustics can form straight arcs. 28 )' 29 ) 
Lips and beak-to-beaks are not listed as elementary catastrophes. The reason for this 
is that these are the cross sections by the source plane of the cusp ridges associated 
with any of the elementary catastrophes in a control space of greater than three 
dimensions. This situation is illustrated well by Fig. 6.12 in Schneider, Ehlers and 
Falco. 10 ) 

As mentioned at the beginning of this section, three more higher order elemen- 
tary catastrophes appear, but they always are a combination of cups and folds. 
Readers can consult the textbook by Schneider, Ehlers and Falco 10 ) with regard to 
the application of the higher order catastrophe for the lensing. 
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Fig. 7. The beak-to-beak caustic. Two critical circular lenses are put on the same lens plane with a 
small separation. The left top and left middle panels are caustics, and the right ones are critical 
lines, respectively. The strengths of the lenses are weaker; that is, the source is closer to the lens 
in the case shown in the top panels. As the source gets closer to the lens, two fold caustic lines 
appearing in the central region merge and finally disappear (changes from the middle panel to 
the top panel). The caustics shown in the top panel are called "beak-to-beak" caustics, owing 
to their shape. The appearence of the images corresponding to each source position are also 
shown. A highly stretched straight arc is obtained when a source exists at the middle of two 
lenses. The time delay surfaces and their isochrones for the lens system corresponding to the 
middle panel are shown in the bottom panels. The left panel is shown for the case when the 
source, the observer and the middle point of the two lenses are aligned. The case in which the 
source position is slightly shifted toward the centre of one of the lenses is shown in the right 
panel. 
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2.3.1. The strong lensing cross section 

For statistical studies of strong lensing events, it is important to compute the 
lensing cross-section: the probability to observe the image with a magnification larger 
than a threshold value [i. The computation of the cross-section near fold caustics 8 ) 
gives a good first order approximation of the total cross section. Indeed, most of 
caustics are folds, and the contribution to the total cross section from a cusp is 
generally negligible, since they are points. Without loss of generality, the origin of 
the coordinate system in the source plane can be put on a fold caustic, and the 
direction of the x and y coordinates can be defined as tangential and perpendicular 
to the fold caustic, respectively. As long as we are interested in a local estimate, the 
x-coordinate can be approximated as the caustic line, and the magnification then 
only depends on the y-coordinate, the distance to the caustic line. 

Thus, the cross section near the fold caustics is given by 



<t(> y) = f J D sdPx dP y , (2-57) 



where P y (n) is the position where the magnification reaches \x and the integral over 
(3 X is taken along the fold caustic. This equation can be easily rewritten as 

r r e vM „ 

<?{> n) = j J D 2 s H(9)de x al6 y , (2-58) 

where we have used the fact that the caustic line is mapped onto the critical line. 
To avoid double counting the area, we integrate only on one side of the critical line, 
although two images with approximately the same magnification appear in both 
sides of the critical lines. Since the magnification is infinite at the critical line, the 
Hessian H(0) is zero on the critical line. Thus the derivatives of the Hessian along 
the critical line are also null, in particular, dH(0)/d9 x = 0. Therefore, the Taylor 
expansion of the Hessian around the critical line is 



dH(6) 

nyuj = 

Then 9 y (fi) is defined by /i as 



H(e) = ~de~ 0y - (2 ' 59) 



8m w 



90y 



-1 

(2-60) 



Finally we obtain 



where d9\\ is the length element along the critical line and |Vj_i^|^ = o is the gradient 
of the Hessian in the direction of the perpendicular to the critical line evaluated on 
the critical line, and to first order it does not depend on fi. We thus conclude that 
the total cross section to observe lensed images with magnification larger than n is 
proportional to fj,~ 2 . 
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§3. Constraining the mass distribution of clusters 

3.1. Modelling the mass distribution with strong lensing 

Strong lensing events, such as GLAs 
and multiply imaged arcs (out-standing 
example is shown in Fig. 8) , constitute a 
powerful tool to constrain the mass dis- 
tribution of cluster cores. As we have 
shown in the previous section, the sim- 
plest way of estimating the mass of a 
cluster with strong lensing events is as- 
suming circular symmetry for the lens 
and to use Eq. (2-19), where the Ein- 
stein radius 6e is estimated from the po- 
sition or curvature of the arc or from the 
distance between multiple images. Such 
an estimate is, however, very rough and 
can easily over or underestimate the ef- 
fective projected mass in the core. 

More accurate modelling of the clus- 
ter mass distributions are based on mul- 
tiple images. The idea here is to op- 
timize the cluster mass model in order 
to reproduce the observed sets of multi- 
ple images within the observational un- 
certainty. A x 2 can be defined by the 
quadratic sum of the difference of the 
central positions of each image mapped 
onto the source plane divided by the observational uncertainty scaled by the lens 
mapping. Alternatively, it can be defined as the quadratic sum of the difference be- 
tween the actual observed images and the images of a fiducial source. The elongation 
and orientation of each image can in principle be added to the optimization proce- 
dure. However, due to the relatively large observational uncertainty compared to the 
image position, they generally impose a loose constraint. Since the number of avail- 
able observational constraints is not large, the number of parameters describing the 
cluster mass distribution should be of the same order as the number of constraints. 
For this reason, the mass model is often described by a linear sum of mass clumps 
described analytically. One of the first best known examples of a cluster mass model 
is the model of A370 by Kneib, Mellier, Fort and Mathez. 30 ) Their model is very 
simple in the sense that it has only two mass clumps centred on the two giant ellip- 
ticals of that cluster, and was constrained by the giant arc (a triple image) and an 
other triple image, B2-B3-B4. From their model they could predict the redshift for 
the source of B2 to be z ~ 0.865. Figure 9 shows the central part of A370 seen with 
fTST/WFPC2 in F675W, with the improved mass distribution model by Bezecourt 



Fig. 8. The HST/WFPC2 image of CL0024 
+ 1654 at z = 0.39. One can clearly see 
that one galaxy is multiply imaged, 7 im- 
ages in this case. The change of the parity 
and blue nature of the lensed images can 
be observed. 
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Fig. 9. The central part of A370 taken by HST/WFPC2 image in F675W the top panel. 31) Overlaid 
is the mass distribution (black thin lines). The most important arclets and multiple images 
are colored. The radial arc can be observed in the south clump. The contour map of the 
X-ray emission obtained by ROSAT/YLKl is shown in the bottom panel. This is taken from 
Hashimotodani. 44 - ) The crosses indicate the positions of the two cD galaxies. There is a small 
cave close to the northern cD. North is up in both figures. 
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et al. 31 ) that extends the former model including cluster galaxies (following the pre- 
scriptions of Kneib et al. 32 ) The latest model of A370 is based on two sets of double 
images and four sets of triple images with a spectroscopic redshift for two of them. 
The fact that the source redshift of B2 was first predicted by Kneib et al. 30 ) and 
then confirmed spectroscopically by Bezecourt et al., 31 ) strengthens support for the 
reliability of the model. 

Cluster-lenses mass models have been considerably improved with the refurbish- 
ment of the Hubble Space Telescope (HST). Indeed, the high image-resolution of the 
ffi>T/WFPC2 camera on a reasonably wide field makes it a unique instrument to 
probe the mass distribution of cluster cores. As was first shown by Kneib et al., 32 ) 
thanks to the identification of a larger number of multiple images, it is possible to 
constrain the mass distribution on the scale from ~500 kpc down to the galaxy scale 
~10 kpc. Strong lensing events do constrain the mass distribution of the cluster 
core, and more specially the mass-to-light ratio of cluster galaxies. Furthermore, by 
measuring the deviation of the shear field from the prediction of the cluster poten- 
tial which is constrained by strong lensing events, the amplitude of the perturbations 
due to the cluster member galaxies can be evaluated. 33 ) Therefore, the mass-to-light 
ratio of the cluster member galaxies can be constrained. Such an approach combined 
with the weak (galaxy-galaxy) lensing approach can measure even more accurately 
the masses of galaxies. 33 )' 34 ) 

3.2. Comparison with X-ray results 

Clusters of galaxies contain huge amounts of hot gas, which emit X-ray through 
bremsstrahlung. This hot gas constitutes the intra-cluster medium (ICM). Since the 
sound crossing time of the ICM through the cluster core is ~ 10 8 yr and generally is 
much less than the age of the cluster (~ several x 10 9 yr), the ICM could be described 
well by hydrostatic equilibrium. Therefore, if the gas density distribution and the hot 
gas temperature distribution are measured, the mass distribution of the cluster can 
be obtained by solving the equation of hydrostatic equilibrium. The ROSAT/HRI 35 ) 
had a good spatial resolution in 0.1-2keV and has provided precise information on 
the distribution of the ICM. The ASCA 37 ^ satellite has a good spectral resolution in 
the 0.5-10keV band and is the first instrument able to measure the ICM temperature 
in the distant lensing cluster (despite a poor spatial resolution). We were very lucky 
to have both complementary X-ray satellite past five years, while the lensing study of 
the cluster mass distribution was emerging as a new powerful tool to constrain cluster 
mass distributions. Lensing and X-ray studies of the cluster mass distributions 
are complementary. While X-ray studies of cluster mass distributions require some 
assumptions on the thermal and dynamical state of the ICM, lensing directly maps 
the surface mass density of the cluster, regardless of its dynamical and thermal state. 
Therefore, lensing studies of cluster mass distributions are more reliable than X-rays 
if sufficiently accurate lensing constraints are available. Comparison of the two 
results provides information on the thermal and dynamical states of clusters. Such 
a comparison may also shed new light on the formation and evolution of clusters 
which tightly links them to modern cosmological interests. 

Peebles 38 ) wrote in 1993, "[...] we are going to extrapolate the physics that 
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is known to be successful until it is seen to fail. [...] the extrapolation out in 
space and back in time is by no means without empirical support. [...] gravita- 
tional lensing of background galaxies by the mass concentrations in clusters of galax- 
ies, as analyzed in general relativity theory, is consistent with the masses derived 
from the motions of the galaxies and from the plasma pressure within the clus- 
ters. The relevant length scale here — the impact parameter at the center — is 
ten orders of magnitude larger than that of the precision tests of general relativ- 
ity in the Solar System and in binary pulsar systems, a remarkable extrapolation. " 
However, soon after the publication of 
Peebles' textbook, it was claimed that 
the mass in the cluster core region esti- 
mated from strong lensing effects is sys- 
tematically a few times larger than the 
mass estimated by X-ray observations 
under the assumption of the hydrostatic 
equilibrium of the ICM. 39 ) " 43 ) The sim- 
ple comparison discussed in these pa- 
pers has created a great deal of excite- 
ment and discussion, in order to try to 
explain such discrepancy. Hashimoto- 
dani 44 ) showed that the discrepancy be- 
tween the lensing and X-ray approaches 
seems to be systematic on a large sam- 
ple of 30 distant cluster lenses (Fig. 
10). Such analyses on such large sam- 
ples was first made by Wu and Fang, 41 ) 
but their X-ray mass estimates were not 
an accurate, since they were not using 
ROSAT/RRl and ASCA data. Allen 
45 ) also analyzed the ROSAT/BRI and 
ASCA data of 13 distant cluster lenses 
and obtained the first reliable mass es- 
timations based on X-ray observations, 
and has made the first robust confirma- 
tion of the existence of the systematic 
discrepancy. 



Fig. 10. The comparison of the cluster mass 
obtained by X-ray observations with those 
obtained by modelling of strong lensing 
events. The lensing masses are quoted from 
the literature, mostly from Wu and Fang. 
41 - ) This figure is quoted from Hashimoto- 
dani. 44 > 



3.3. Understanding the mass discrepancy 

In this section we introduce ideas proposed to solve the mass discrepancy. We 
also attempt to discuss the best current understanding of the problem. 

3.3.1. Non-thermal pressure 

Loeb and Mao 39 ) proposed that non-thermal pressure induced by equipartition 
turbulent and magnetic pressure plays a crucial role in supporting the ICM against 
the cluster gravitational pull. Therefore, if one neglects the contribution of these 
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non-thermal pressures, one could easily underestimate the gravitational mass of the 
cluster from the ICM distribution by a factor of 2-3. They concluded that the ob- 
served high Faraday rotation measures (hereafter RM) in the directions of radio lobes 
in several (low-z) clusters would be consistent with the required equipartition mag- 
netic field needed in lensing clusters to resolve the discrepancy However, Makino 46 ) 
pointed out that the magnetic field strength estimated from the RM in the direction 
of background radio sources is an order of magnitude less than the equipartition 
magnetic field strength. He argued that the extremely high RM obtained in the 
direction of the radio lobe is likely to be intrinsic to the radio lobe but not to the 
ICM. The magnetic field strength in cluster lenses is undetermined at present. It 
is, however, constrained by hard X-ray and radio observations. Indeed, extended 
radio halo caused by synchrotron radiation has been detected from several nearby 
clusters. 47 ) The relativistic electrons in the cluster, which constitute the source of 
the synchrotron radiation, produce a hard X-ray tail in the cluster X-ray spectrum 
by inverse Compton scattering of the cosmic microwave background radiation. 48 ) It 
is expected that the next generation of X-ray satellites (Astro-E) will judge the pos- 
sibility of the equipartition magnetic field in the ICM by measuring the hard X-ray 
tail. 

3.3.2. Elongation of the cluster and 3D mass distribution 

Loeb and Mao 39 ) pointed out that the cluster elongation along the line of sight 
is not a plausible explanation since it would require a large axial ratio that would 
cause a dynamical instability. 22 ' Furthermore, it is very unlikely that all the lensing 
clusters are elongated along the line of sight by accident. However, the possibility of 
the superposition of several clusters along the line of sight could easily explain the 
discrepancy observed for some optically selected lensing cluster. For example, the 
deep wide field redshift survey of C10024+1654 49 )' 5 °) seems to favor the existence of 
the superposition of two close-by clusters. 

3.3.3. Cooling flow 

Clusters of galaxies are categorized into two groups according to the nature of the 
distribution of the ICM in the cluster core. The clusters in which the X-ray emission 
has a prominent central peak that coincides with the center of a bright elliptical 
galaxy is called XD. Clusters in which the X-ray emission is dispersed and does not 
exhibit a clear coincidence between the X-ray peak and the bright galaxy center is 
called non- XD. 51 ) The XD clusters are believed to harbour a cooling flow. > 

The multi-phase nature of the ICM in the central part of clusters due to cooling 
flow has been considered as a possible source of the underestimation of the X-ray 
cluster mass. 45 -*' 53 ) Allen 45 ) studied X-ray data of 13 lensing clusters of galaxies 
and broke his sample into three groups, on their central cooling time: cooling flows, 
non-cooling flows, and intermediate samples. He showed that the mass discrepancy 
is significantly reduced for cooling flow samples by applying the cooling flow model 
to the spectral fitting. If the cooler components induced by cooling flow exists, 
the temperature of the hottest component should become higher by a factor of 2- 
3 than the temperature obtained by a single phase model fitting of the observed 
emissivity-weighted temperature. Since the hydrostatic mass depends linearly on the 
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temperature of the hottest phase, it increases the X-ray mass. He also noticed that 
the centre of the X-ray emission is significantly shifted from the centres of curvature 
of the lensed arc for the non-cooling flow sample. Therefore, these clusters may 
not be in dynamical equilibrium and probably are post-mergers. However, several 
arguments do not support this idea. ASCA data of nearby XD clusters are fitted well 
by a two-temperature phase model in the core and with a single phase model in the 
outer region. 54 ) The hotter phase stays isothermal throughout the cluster, and the 
weighted averaged temperature throughout the cluster is not so much different from 
the temperature of the hotter phase; this seems to be contradictory to Allen's 45 ' 
analysis. The distant cluster RXJ1347-1145 is the most X-ray luminous cluster ever 
discovered and is thought to have an extremely massive cooling flow, according to 
Allen. 45 ) Recently a radial profile of the Sunyaev-Zel'dovich (hereafter SZ) increment 
was measured by JCMT/SCUBA 55 ) for this cluster. It was shown that the combined 
fitting 56 ' to the X-ray spectrum obtained by ASCA and to the SZ radial profile puts 
a strong constraint on the strength of the possible cooling flow and leads us to reject 
the massive cooling flow model proposed by Allen. 45 ) 

Clearly, deeper insight here is needed to improve our knowledge on the cooling 
flow model. The next generation of X-ray satellites, like AXAF/XMM with their 
spatially high-resolution temperature measurement (of cluster-lenses) and Astro-E 
with its high resolution X-ray line spectroscopy (of nearby cooling flow candidate 
clusters), and on-going (e.g. JCMT/SCUBA) and future projects (e.g. LSA/MMA, 
LMSA) of mm and submm observations of SZ and dust emission from clusters, 57 ) 
will soon help us in understanding the physical properties of the ICM in a cooling 
flow cluster. 

3.3.4. ICM not in dynamical equilibrium 

Allen 45 ' also suggests that for non-cooling flow cluster, the main reason for the 
discrepancy is that these clusters are not in dynamical equilibrium, as they are post- 
mergers. The evidence given to support this idea (see also Hashimotodani 44 ) ) is that 
the centre of the X-ray emission is significantly shifted from the centres of curvature 
of the lensed arc. Such a possibility was first proposed by Kneib et al., 58 ) in their 
detailed mass distribution model of A2218 based on lensing data and compared with 
the ROSAT/HR1 X-ray map. The lensing model predicts a bimodal distribution 
centered on the two brightest cluster galaxies. The distribution of the X-ray emitting 
gas is, however, not consistent with the lensing mass model: the X-ray bright core 
is elongated perpendicular to the elongation of the lens model, with a shift in the 
position. Moreover, there is no secondary peak in the X-ray emission associated with 
the secondary clump inferred from the lens model. Thus, they suggested that the 
ICM in A2218 is highly disturbed by the passage of a subcluster, and the ICM is no 
longer in hydrostatic equilibrium in the cluster core. 

This possibility is also supported by other evidence. The left-hand side of Eq. 
(2-47) is (830km/sec) 2 = 6.9 x 10 15 cm 2 /sec 2 for the main clump model presented in 
Kneib et al. 58) The best fitting values to the X-ray data yields (/3/0.62)(A;BT/7.04keV) 
= 6.6 x 10 15 cm 2 /sec 2 for the right-hand side of Eq. (2-47). These two values are 
consistent within X-ray measurement errors for temperature 7.04 ± 0.07keV 59 ) and 
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for slope parameter of X-ray surface brightness f3 = 0.62^Qgg 44 ) where errors are 
90% one parameter errors. The main discrepancy in this cluster can be explained 
by the large X-ray core radius of 53", 60 )> 61 ) compared to the value in the lensing 
model, ~ 20". A simple check of this can be done in the following way. Inserting 
d c = 53", <9 E = 23" and z s = 2.6 into Eq. (2-36) yields k B T = 17keV, while C = 20" 
yields k B T = 9.5keV, which is close to the measured temperature. If the post-merger 
causes the cluster central gas to disperse and to thus forces it away from equilib- 
rium, but allows for the dark matter distribution to remain centrally concentrated 
(as suggested by lensing), one should use a core radius of ~ 50 — lOO/i^kpc (about 
20") for the dark matter distribution (which is the typical X-ray core radius for an 
XD cluster). Then, assuming that the gas distribution at the cluster outskirts is in 
hydrostatic equilibrium, where the observed (3 value and the temperature are mea- 
sured, the mass discrepancy problem for A2218 is solved. The last assumption may 
be safely applied, since the mass of the subclump supposed to have collided is much 
smaller than the mass of the main clump. 58 ) 

3.3.5. Effects of complex mass distribution 

Recent studies have revealed other plausible possibilities to reconcile the discrep- 
ancy. The main reason here is overestimation in the lensing mass determinations. 
By definition, a large number of galaxies populates the cluster core. Therefore, we 
cannot neglect the contribution of their stellar mass and their dark halos. With such 
complex models, the total mass needed to produce arcs is significantly decreased 
compared to smoothed and symmetric models. 



Fig. 11. The central part of cluster A2390 seen with JTST/WFPC2/F814W. A-B-C is a straight 
arc. It is in fact composed of two objects: B-C at z = 0.913 and A at z = 1.03. The objects 
H3a-b and H5a-b are two sets of multiple images with z = 4.06. ln ' 

For example, Abell 2390 has a "straight" arc. 2S ) It requires a special geometry 
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for the mass distribution of the lensing object and imposes a strong constraint on 
the cluster mass distribution (Fig. 11). Pierre et al. 62 ) presented a detailed analysis 
of the ROSAT/HRl image and found an extended local sub-peak of X-ray emission 
which might be associated with the existence of a dark matter clump. This clump 
would act as a perturber, just as predicted by Pello et al., 28 ) to reproduce the straight 
arc. In fact Pierre et al. 62 ) successfully constructed a lens model of A2390 defined 
by two mass clumps. Bohringer et al. 63 ) measured the temperature of this cluster 
and studied its X-ray data, which exhibits a very smooth, centrally peaked X-ray 
emission, and evidence of massive undisturbed cooling flow, indicating that the ICM 
is in hydrostatic equilibrium in the cluster potential. They showed that the X-ray 
mass estimation assuming that the cooling flow has little effect on the temperature 
measurement agrees well with the lensing mass inferred by Pierre et al. 62 ) If the 
lensing mass of this cluster is estimated with the formula for the circular symmetric 
lens, the estimated mass is several times larger than the X-ray mass. These results 
indicate that the introduction of a small asymmetry in the mass distribution of 
the cluster core can substantially reduce the lensing mass compared to the circular 
symmetric approximation. 

Hattori et al. 64 ) showed that in the 
distant cooling flow cluster CL2236-04, 
lensing and X-ray masses agree if the 
lensing mass model includes a second 
clump of mass centered on the second 
brightest member (Fig. 12). This suc- 
cessful lens model was first proposed by 
Kneib et al. 65 ^ prior to any X-ray obser- 
vations. 

Similarly, Hashimotodani et al. 67 ) 
have shown that there is no discrep- 
ancy if a secondary potential due to the 
bright cluster galaxy near the arcs of 
MS0302. 7+1658 and MS1621. 5+2640 is 
included in the lensing model. If the 
contributions of the secondary clump 
are not taken into account, the X-ray 
mass of the clusters are a factor of 2-4 
less than the required mass to explain 
the arc positions in these two clusters. 
Therefore, the introduction of the sec- 
ondary galaxy potential is essential. 

In the case of the above mentioned 
clusters, the nature of the galaxy poten- 
tials are constrained consistently with 
their optical properties. The strong 
lensing in CL2236-04 and MS0302. 7+1658 is an example of marginal lensing (lips in 
CL2236-04 and beak-to-beak in MS0302.7+1658). 



Fig. 12. The V+R image of CL2236-04 at z = 
0.55 taken by ESO 3.6m 66 - ) superposed on 
the X-ray image obtained by ROSAT/HRL 
64 - ) The straight arc at z = 1.16, as bright 
as the bright cluster member galaxies, can 
be observed at the north of the cluster core. 
A bright cluster member galaxy exists close 
to the arc. 
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3.4. A closer look at individual cases 

It seems that the discrepancy could be explained by different arguments. To get 
a better feeling of the current limitation on the data, we now describe in detail some 
peculiar cases. 

3.4.1. MS1621.5+2640 

The situation in MS1621. 5+2640 is typically a barely constrained cluster lens. 
The observed arc Al configuration can be reproduced by qualitatively different mass 
distribution models. 67 ) The first model assumes that the ICM is isothermal and in 
hydrostatic equilibrium. The second model assumes that the dark matter distribu- 
tion has a small core radius of ~ 50 kpc but is much smaller than the X-ray core 
radius of MS1621 (rf = 256 kpc). The last possibility was considered to take into 
account a possibility similar to A2218. As pointed out by Moris et al., 68 ) the large 
core radius and irregular morphology of MS1621. 5+2640 may be a signature of a 
disturbed distribution of the ICM due to merger. 

Arc Al can be explained by both models if a secondary galaxy contribution is 
taken into account. However, we cannot distinguish between models at this moment 
- mainly due to a lack of lensing constraints. Deeper lensing/X-ray observations 
will be important to understand the discrepancy. 

3.4.2. RXJ1347.5-1145 

The origin and history of the discrepancy for RXJ1347.5-1145 is somewhat dif- 
ferent from others. Schindler et al. 70 ) found two arcs in this field. Schindler et 
al. 72 -* made a lensing mass estimation assuming that they are multiple images, and 
found that the lensing mass is a factor of three larger than that expected from X-ray 
observations. However, it is now clear that they are single images of different back- 
ground galaxies, according to their color difference. 71 ) Therefore, the mass estimate 
made by assuming that the arc position is the same as the Einstein ring radius is 
not valid (and this is the case for other clusters). Fischer and Tyson 71 ) measured a 
weak lensing shear field and found that the mass obtained by a weak lensing mass 
reconstruction is a factor of 2 larger than that expected from X-ray observations. 72 ) 
However, as they do not have any multiple images, it is very difficult to believe 
an absolute mass estimate from weak shear estimates, as this will depend on the 
assumed redshift distribution of the background galaxies as well as the correction 
factor applied to compensate for the seeing effect. For example, if the mean source 
redshift of the shear field measured by Fischer and Tyson 71 ) through RXJ1347.5- 
1145 is as large as 1 (and not ~ 0.65, as they assumed) the required weak-shear mass 
is reduced by a factor of 2, and both the lensing and X-ray mass estimation provide 
consistent results. A photometric estimate of the source redshift may in the near 
future address such problems in more detail, especially with the new 8-10m class 
telescope, which offers deep high quality images in relatively short exposures. 

3.4.3. A370 

The cluster A370 is one of the best modeled lensing clusters. 30 )' 31 ) Ota, Mitsuda 
and Fukazawa 42 ) found a factor of 2 discrepancy in the cluster central mass estima- 
tion between X-ray and strong lensing. However, the assumed cluster center includes 
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very large error because of the poor ASCA position determination accuracy (~ 1'). 
Hashimotodani 44 ) analyzed ROSAT/HRl data and obtained interesting results (Fig. 
9). The X-ray emission is peaked at the center of the southern cD galaxy (see Fig. 
9). However, there is no X-ray peak associated with the northern cD, although the 
velocity dispersion and the core radius of the northern clump are similar to those 
of the southern clump in the lens model. 31 ) The existence of the northern mass 
clump has also been confirmed by the weak lensing mass reconstruction using the 
HST/ WFPC2 image, 7 ) been therefore making its existence certain. Hashimotodani 
revised the results of the ASCA temperature. It can be safely assumed that the 
temperature measured by ASCA is the temperature of the gas associated with the 
southern clump only. Checking the left- and right-hand sides of Eq. (2-47), they 
derived (3 and the temperature, and found them to be consistent with the velocity 
dispersion of the lens model. 31 ) The X-ray core radius of the southern clump is also 
consistent with the lens model. Therefore, for the southern clump, X-ray results 
are consistent with the mass distribution of the southern clump predicted by lens 
modelling. 

Thus, the problem now, is to explain why no X-ray emission is detected in the 
northern clump. Is this due to merger? May the gas around the northern clump 
and the southern clump just have merged? But then why did the southern clump 
not suffer from the merging? It is likely that A370 is the superposition of two 
clusters with slightly different redshifts, as suggested by their velocity histogramme. 
However, this is not a fully satisfactory explanation, as very few X-rays are detected 
in the Northern clump. 

Spatially resolved X-ray spectroscopy with AXAF and XMM observations and a 
deep wide field spectroscopy survey similar to the C10024+1654 survey will provide 
exciting new insight for understanding cluster formation. 

3.4.4. Is there really a discrepancy? 

It is now clear that comparison between strong lensing and the X-ray mass must 
be done carefully from cluster to cluster, based on the detailed X-ray image and 
galaxy distributions. However, we must note that, except for a few well studied 
clusters as introduced in this section, the lensing studies are sometimes not done 
very accurately — especially when they estimate the mass from the Einstein radius. 
Since the measurement of the mass of the cluster with strong lensing events is based 
on model fitting, if the model introduced is not motivated physically, it may lead to 
a large systematical error in mass estimation. 

Further work on the mass discrepancy between X-ray and lensing studies should 
consider seriously the complexity of the mass distribution (This is in particular true 
for the lensing estimate, but it also applies to X-ray analysis.) before reaching a 
hasty conclusion on an unlikely mass discrepancy. 

3.5. Implications 

In this section, we summarize what we have learned from strong lensing studies 
of the cluster mass. 
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3.5.1. Radial arcs and the very central mass profile 

As explained in §2, the radial arcs impose a strong constraint on the degree of 
central mass concentration in the cluster. Clusters with a radial arc are listed in 
Table I. A good example of a radial arc is shown in Fig. 13. Radial arcs can only 
be seen on the HST images for A370/MS0440/AC118, as they are very thin! The 
existence of the radial arcs in these systems favors a central surface mass density 
distribution flatter than oc _1 (a mass density distribution flatter than oc r~ 2 ). The 
thinness of the radial arcs would indicate a relatively cuspy mass profile; that is 
0.5 < 5 < 1, where £ oc 6~ s . The expected slope of the surface mass density is 
steeper than that of NFW. This may suggest that the radial mass density profile of 
the cluster central region is steeper than r _1 , as indicated by the higher resolution 
simulation using GRAPE, 77 ) which has a higher resolution than NFW's simulation. 
However, the central giant elliptical certainly has influence on the width of radial 
arcs. Therefore, the detailed quantification of the tangential deformation rate of the 
radial arcs are very important. Resolving the width, measuring the redshift and the 
velocity dispersion of the source galaxies of radial arcs, and measuring the velocity 
dispersion profile of the central elliptical galaxy will be key observations of the 10m 
class telescopes, such as Keck, SUBARU, VLT, Gemini and in the moderate future 
NGST. 

Table I. List of detected radial arcs. The first column contains the names of the clusters and radial 
arcs. The paper in which the radial arc was first reported is cited. Since the radial arcs are 
elongated along the radial direction by about a few arcsec, the distance to nearby cD is measured 
from the approximate arc center and is shown in the second column. The third column gives the 
redshift of the clusters. None have a spectroscopic redshift yet. The last column is the model 
prediction of the arc redshift. 



cluster (arc name) dist to cD Zd z plc d 



MS2137-23 (Al) 73) 


-5" 


0.313 


A370 (R) 105) 


~8."4 


0.37 1.3±0.2 1()5) 


MS0440+02 (A16) 74) 


-6" 


0.19 


MS0440+02 (A17) 74) 


-6" 


0.19 


AC114 (A4-A5) 33) 


~ 2."7 


0.31 1.76 ± 0.15 33) 


AC118 75) 


~5" 


0.33 


AC118 75) 


~5" 


0.33 


CL0024-1658 75) 


~ 10" 


0.39 



3.5.2. The cooling flow interpretation 

Important implications for the cooling flow model can be extracted from the 
studies of A2390, CL2236-04 and MS0302. 7+1658. These clusters have a high central 
electron density and a short radiative cooling time and therefore are expected to have 
cooling flow. Cluster mass estimates from X-ray observations using ASCA data with 
a single phase model fitting 64 ) are consistent with the strong lensing estimate. If 
the effect of the cooling flow is significant, the temperature of the hottest phase gas 
in these clusters could be a factor of 2-3 higher than the single phase model fitting 
results. However, these two clusters are marginal lenses, and the lens models are very 
sensitive to the total mass, and therefore to the cluster temperature. Increasing the 
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Fig. 13. The central part of cluster MS2137-23 seen with HST/WFPC2/F702W. 76) A radial arc is 
clearly seen. 

temperature by more than factor of 2 is incompatible with the existence of a straight 
arc in these clusters. Therefore, even if cooling flow exists in these clusters, the 
deviation of the hottest phase temperature from the ASCA measured temperature 
must be negligible. 

3.5.3. Existence of a universal mass profile 

The two models that can explain the observed arc configuration of MS1621.5 
+2640 have very important but very different cosmological implications. If the hy- 
drostatic model is correct, MS1621. 5+2640 has a very large core radius and a low 
central electron density, respectively a factor of 5 larger and 10 less than those of 
MS0302. 7+1658. MS1621. 5+2640 is known to be a non-cooling flow cluster, since 
its central cooling time is longer than the Hubble time, although the temperature 
of MS1621. 5+2640, ~ 6.5keV, is not much different from that of MS0302. 7+1658, 
~ 5keV. This indicates that there are large variations in the core radius and the 
central density of the cluster mass distribution even if the temperature of the clus- 
ters have similar values. This fact has been noticed by Fujita and Takahara, 78 ^ who 
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examined the fundamental plane of nearby clusters. They concluded that if the ratio 
of the dark matter density to the gas density at the cluster center is constant in all 
clusters, the cluster mass distribution cannot be characterized only by its virial mass 
and redshift; the formation epoch of the cluster and its evolutional history may also 
be important parameters. Since Hashimotodani et al. 67 ) are using lens modelling to 
estimate the central dark matter density, their results are free of the assumption on 
the ratio between the dark matter and gas densities. Therefore, the standard Press- 
Schechter theory 79 ) should be used with caution, as it assumes that the cluster is 
just virialized at the observed redshift when the theoretical prediction of the clus- 
ter temperature and luminosity functions are made. Modifications of the standard 
Press-Schechter theory, which take into account the difference between the formation 
and observed redshift, are thus crucial for comparing the theoretical predictions with 
the observed cluster temperature and luminosity functions. 80 -* " 84 ) Nevertheless, if 
the compact core model is correct, it indicates that the central electron density is 
not a good representative of the central dark matter density, and the possibility that 
the clusters have a universal mass density profile, characterized only by its redshift 
and virial temperature, may then be close to reality. 

It is also worth to note that Bartelmann and Steinmetz 142 ) claim that the (3- 
model does not provide a good description of the X-ray surface brightness profile of 
clusters as determined from their iV-body-hydrodynamics simulations. They argue 
that the /3-values in the fit depend critically on the background level and that this 
introduces a significant bias in the mass determination built on /3-model fits. Actu- 
ally for most of the lensing clusters, the central X-ray surface brightness is close to 
the background level of ROSAT/WRI, and the (3 values obtained, (3 ~ 0.45 — 0.55, 
are systematically somewhat lower than those for nearby clusters, (3 ~ 0.6 — 0.7. 
However, we have seen that the cluster mass distribution models constructed with 
these relatively small (3 values are consistent with the cluster mass distribution mod- 
eled by the strong lensing observations. Although this result may interpret that the 
obtained small value of (3 for the distant lensing clusters represents the real mass 
distribution well, further X-ray observations with much higher sensitivities and with 
similar or better spatial resolution than J?OSAT/HRI, e.g. AXAF, XMM, are re- 
quired to clarify the radial profile of clusters in the outskirts. 

3.5.4. Extrapolation of general relativity up to the cluster scale 

We can now draw important conclusions concerning whether cluster lensing pro- 
vides empirical support for the extrapolation of general relativity up to the cluster 
scale. If this extrapolation fails on the cluster scale, then the discrepancy between 
lensing and X-ray mass estimates should be systematic. However, as we have seen, 
excellent agreement between these results has already been reported in about half a 
dozen clusters. Clusters still exhibiting a discrepancy are likely to be understood in 
an astronomical context when better quality data are in hand to complete a detailed 
analysis. Therefore, we conclude that the extrapolation of general relativity up to 
the cluster scale is empirically supported by cluster lensing. 
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§4. The missing lens problem and very distant clusters 

Looking for and studying very distant galaxy clusters, clusters with z > 1, are 
very important because: i) the number density of rich clusters at large redshifts is a 
very sensitive measure of the mean density of the Universe; ii) high redshift clusters 
provide important information to understand the formation and evolution of galaxies 
(e.g. the evolution of the metallicity of the hot intra-cluster medium, morphological 
evolution, and star formation history in a dense environment). 

However, this is a difficult enterprise, as there are yet only a few confirmed very 
distant galaxy clusters. The first necessary step is to identify very distant clusters by 
conducting very deep observations. A random search of the sky is not practical since 
one expect to detect only a very small number of very distant clusters. A possible 
route to identify high redshift clusters is to search in a dark lenses field, as explained 
below. 

The expression 'missing lens problem' can be found in the unique textbook on 
gravitational lensing written by Schneider, Ehlers and Falco. 10 ) A number of quasar 
pairs are candidates to be lensed systems. In a high fraction of these systems, either 
no deflector or only a part of a deflector is clearly identified. The difference between 
this problem and the usual missing mass problem is that the mass-to-light ratio 
required to explain the pair of quasars by a gravitational effect is extremely large 
compared to that of galaxies. A yet unidentified part of lens objects (distinct from 
a galaxy and/or galaxy clusters), is called a 'dark lens'. The dark lens search is an 
attempt to identify dark lens objects as already known objects. 85 ) If all ultra-deep 
multi- wavelength searches fail to find any evidence of galaxies and/or clusters as 
the lenses, it could lead us to conclude the existence of a yet unknown new type of 
object which contains barely any luminous matter. Table 2 of Hattori 86 ) summarizes 
such lens candidates. A large fraction of the lens candidates exhibit the missing lens 
problem. We now discuss two of them, for which mysterious rich clusters of galaxies 
may play an important role. 

4.1. The lens system MG2016+112 

Hattori et al. 87 ) have discovered a cluster of galaxies at z = 1, named AXJ2019 
+1127, using X-ray observations in the direction of MG2016+112, a confirmed mul- 
tiple QSO at z = 3.27. 88 -* When discovered, this cluster was claimed to be a dark 
cluster because of the lack of an optical counterpart, 89 )' 9 °) although the existence 
of a massive cluster has been predicted by theoretical modelling. 91 ) Recent deep 
spectroscopic and imaging studies in optical and IR bands unveiled the nature of 
the dark cluster. Kneib et al. 92 ) spectroscopically found 6 possible member galax- 
ies in this field (Fig. 14). Using deep V, I Keck images, and wide-field K s NTT 
images, Bem'tez et al. 93 ) found a tight red sequence of galaxies which has a slope 
in good agreement with the model predictions for z ~ 1 clusters. 94 ) Based on the 
identified number of member galaxies 93 ** and the mass estimated from the X-ray 
temperature, 87 ^ the mass-to-light ratio within an aperture of O.S/i^Mpc was found 
to be M/Ly ~ HO/150, which is consistent with the mass-to-light ratio found in 
nearby clusters. 95 ^ Therefore, it is very likely that a massive cluster of galaxies ex- 
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Fig. 14. Unveiling the nature of the dark cluster AXJ2019+112. The I-band image of AXJ2019+112 
field taken with NOT and overlaid X-ray contours taken by ROSAT/HRI. 87 - ) The spectroscop- 
ically measured redshifts of galaxies are attached and possible member galaxies of the cluster 
AXJ2019+112 are marked by "the thick circle." Although a dozen member galaxies were con- 
firmed by deep Keck imaging studies and photometric studies, 93 - ) there is a debate on the 
interpretation of the ROSAT/HRI results. 
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ists at z = 1 in the line-of-sight of MG2016+112. AXJ2019+1127 is one of the most 
massive clusters known at z ~ 1. Furthermore, it is one of the rare high-redshift 
clusters for which strong lensing events are clearly detected. Thus, it provides a 
unique opportunity to study the mass distribution in a very high redshift cluster. 
Bem'tez et al. 93 ) successfully modeled (with a cluster surface mass density distri- 
bution similar to the NFW model) the multiple images of QSO and the extended 
arc-like image detected in the HST/NICMOS image. The cluster mass derived from 
the lens modelling is also consistent with X-ray results. 

It is worth noting that the Einstein ring radius of AXJ2019+112 is about 2" 89 ) 
if the cluster centre coincides with the giant elliptical located at the middle of the 
triple QSO images, and it is very small compared to clusters at intermediate red- 
shift (z ~ 0.2 - 0.5), which are as hot as AXJ2019+112. Therefore, the central 
surface mass density of AXJ2019+112 is shallower than those cluster lenses with 
similar temperatures at intermediate redshift. This may be reflecting the situation 
in which 1) there is a large variation in the central surface mass density from clus- 
ter to cluster independent of their redshift, or 2) there is a strong evolution in the 
central surface mass density. However, it should be emphasized that the interpre- 
tation of the ROSAT/RRl results of AXJ2019+112 leads to different results. 87 )' 93 ) 
Although Bemtez et al. 93 ) claimed that there are three X-ray sources in this field, 
since there are three patches in the X-ray image, it is dangerous to take these as real 
sources. For example, the early ROSAT/HRl image of A370 n ) exhibited two X-ray 
peaks coinciding with the centre of the two giant ellipticals. However, as shown in 
Fig. 9, the most up-to-date X-ray image of A370 (with an exposure time twenty 
times longer than the previous one) does show only one prominent peak. Clearly, 
photon statistical errors must be considered seriously before reaching hasty, unjus- 
tified conclusions. Therefore, determination of the accurate cluster central position, 
morphological studies by detecting more than an order of magnitude larger num- 
ber of X-ray photons, and the accurate measurement of the X-ray temperature of 
AXJ2019+1127 will be key observations to conduct with future X-ray missions. 

4.2. The double quasar Q2345+007 

The quasar Q2345+007 has been one of the most mysterious lens candidates of 
double quasars since its discovery. 96 ) In spite of deep and wide field optical searches, 
the main lens has not yet been identified. 97 ) A very faint galaxy was found at the edge 
of the secondary image, B image, after subtraction of the point spread function of the 
image. 9S ) Since a C IV doublet absorption line (z = 1.483) 10 °) was found in the B 
image, the redshift of the faint galaxy is supposed to be 1.483. However, the expected 
mass of the galaxy is too small to explain the wide separation of the two images, 
unless it has an extremely large mass-to- light ratio. A large cluster at z = 1.49 
was claimed, since both of the two images have metal absorption lines with redshift 
z = 1.491. Bonnet et al. 101 ) reported the detection of a possible lensing cluster from 
a weak lensing shear field with a strength of 7 ~ 0.15, confirmed by van Waerbeke 
et al. 102 ) and arclet candidates. After this detection, a galaxy cluster candidate was 
found as an enhancement in the number density of faint galaxies at the right location 
in the sky predicted by the shear field. 98 )' 103 ) Pello et al. 104 ) made photometric 
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redshift estimations for the galaxies in the cluster candidate field and found an excess 
of galaxies at z ~ 0.75 ± 0.1. Assuming the cluster has this redshift, the velocity- 
dispersion required to produce the observed shear pattern should be 790l{7Qkm/s, 
104 ) which corresponds to a hot gas temperature of ksT ~ 4.1^'g(/3/1.0) _1 keV. The 
typical value of (3 (~ 0.66) yields k^T ~ 6.2keV, which is as hot as CL2236-04 at 
z = 0.552. Therefore, it is expected that the cluster speculated from the lensing 
and photometric studies is a bright X-ray source. However, non detection of X-ray 
emissions from the cluster by ASCA and ROSAT/UKI observations was reported. 86 ) 
This deepened the mystery further. 

4.3. Implications 

Other than the original interest in the missing lens problem (that is that it 
indicates the existence of a yet unknown new type of dark objects in the universe), 
it is likely that the dark lens search is an efficient way to detect a very distant 
massive cluster and provides a unique opportunity to study mass distributions in 
very distant clusters. Therefore, searches for massive condensations in the field of 
dark lens objects by the detection of the lensing signatures with deep optical imaging 
and by X-ray detection is a possible direction for observational cosmology in the near 
future. 

§5. Clusters of galaxies as natural gravitational telescopes 

5.1. Looking at distant galaxies through cluster-lenses 

Cluster lenses magnify and distort galaxies behind them. For an efficient cluster 
(a massive cluster with intermediate redshift z ~ 0.2-0.4) the magnification factor 
for the faint galaxy population is typically ~ 2 in a few square arc-minutes. This 
gain would correspond to a factor of ~ 1.5 in the diameter of a telescope or an 
increase by a factor of ~ 4 in exposure time. Clearly, looking through cluster lenses 
can be of some reward when studying a faint (and thus distant) galaxy population, 
as it allows us to observe intrinsically fainter objects that would not otherwise be 
observable when looking at blank fields. Cluster lenses do magnify but also distort 
the shape of distant galaxies, and the farther the source, the stronger the distortion. 
Hence the shape of a lensed galaxy (and whether it is multiple or not) is generally 
a good distance indicator. These properties have effectively been used in detail in 
recent years to study the faint and distant galaxy population. Thus cluster lenses 
have been nicknamed "natural telescope" . 

Cluster lenses can be used as telescopes in two different ways, i) as magnifying 
glasses for non-resolved sources. Here, only the magnification property is used, to 
constrain galaxy counts to fainter limits, and to study photometric properties of 
fainter galaxies, ii) Cluster lenses can also be used as real telescopes for resolved 
sources. Here, both magnification and distortion properties are used to 1) constrain 
galaxy morphology to fainter limits, and 2) constrain their distance through their 
observed shape. 

The high quality imaging obtained with the HST/WFPC2 camera provides the 
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key observations that make possible the use of cluster lenses as natural telescopes. 
Indeed, thanks to its high-resolution imaging capabilities it allows us to better iden- 
tify morphologically multiple images and to accurately compute the shapes of arclets. 
These critical parameters are the necessary ingredients for the construction of a ro- 
bust mass distribution model of cluster lenses. Furthermore, if at least one multiple 
image has a spectroscopic redshift, then the mass distribution model is absolutely 
calibrated, making a cluster lens an effective cosmological telescope. 

5.2. Resolved lensed galaxies — Morphological and physical properties 

Morphological properties of distant 
lensed sources were first addressed by 
Smail et al., 105 ) who estimated the in- 
trinsic linear sizes of the galaxies and 
showed them to be compatible with a 
significant size evolution with redshift. 
Smail et al. 105 ) studied the HST im- 
ages of four distant massive clusters and 
found that the half light radius of the 
arcs along the radial direction of the 
cluster potential is a factor of 1.5-2 
times smaller than the equivalent pop- 
ulation in the local field. If the gross 
properties of the central regions of the 
cluster surface mass density at the posi- 
tion of the tangential arcs can be charac- 
terized by U ~ the radial stretching 
rate is unity (see §2.1), and the above 
mentioned results can be taken as the 
intrinsic nature of the arcs. Since they 
showed that the U ~ O^ 1 is a good rep- 
resentative of the central mass distribu- 
tion of their sample clusters, they sug- 
gested that the dominant population of 
star-forming galaxies at z ~ 1 is a factor 
of 1.5-2 times smaller in size than the equivalent population in the local field. If 
the cluster surface mass density profile obtained at the radial arc position can be 
extrapolated to the tangential arc position, the surface mass density is shallower 
than E oc and the radial width of the tangential arc is larger than the intrinsic 
source size. This strengthens the conclusion reached by Smail et al. 105 ) Recently, 
a number of high-z arcs and arclets have been confirmed spectroscopically. 14 )' 106 ^ 
- 108) a detailed analysis of their intrinsic size and morphology is then possible using 
lens modelling. More recently, several detailed analyses of the morphology of high-z 
arcs were reported using lens modelling. For example, Soucail et al. 112 ) found a 
ring galaxy LRG J0239-0134 at z = 1 through the cluster lens A370 (Fig. 15). The 
morphological nature of this object is more complex than that of nearby normal 



Fig. 15. The image of the distant ring galaxy, 
LRG J0239-0134 at z = 1.062 ±0.001 mag- 
nified by cluster lens A370. 49) ISO/LW3 
contours overplotted on HST/WFPC2 
F675W image (with a 12" size). This ob- 
ject is appeared closed to the galaxy #32 
but this galaxy has been subtracted for 
clarity (a black dot position at the right 
of the ring galaxy) . 
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galaxies and is very similar to the Cartwheel galaxy, 113 ) which is thought to be an 
interacting galaxy. Most of them appear to be knotty with a more complex mor- 
phology than their local counterparts, 109 - ) " m ) although there may be some biases 
due to observations in the UV rest frame. 

5.3. Resolved lensed galaxies — Distances of faint populations 

The probability distribution of the redshift of an arclet for a given mass distribu- 
tion of the cluster and the shape of the arclet depends only on the intrinsic ellipticity 
distribution of faint galaxies: 32 ) the larger the redshift, the larger (in general) the 
deformation induced by the cluster. From a secure mass distribution defined by a 
few sets of multiple-images, we can then estimate a likely value of z of arclets behind 
well-constrained clusters. This is most interesting, as this method is purely geomet- 
rical and therefore does not suffer from the spectroscopic bias that provides redshifts 
only for these faint galaxies with (strong) optical emission lines. To evaluate the 
accuracy of this technique on a cluster-lens A22 1 8, 32 -* Ebbels et al. 114 ) have suc- 
cessfully measured with the 4.2m WHT the redshift of 19 arclets, providing a first 
confirmation of the lensing method. Their results shows that the mean redshift of 
the faint field population at R ~ 25.5 (B ~ 26-27) is low, (z) = 0.8-1. Similar work 
is now in progress for other cluster lenses, such as A2390 115 ) and AC114. 

5.4. Non-resolved lensed galaxies in the Submm and MIR bands 

A fruitful avenue that has developed in the last two years is the study of distant 
galaxies in until recently unexplored windows of the electromagnetic spectrum: the 
Mid-IR, the Sub-millimetre and Millimetre bands. Indeed, the ISOCAM camera on 
the ISO satellite and the SCUBA instrument on JCMT, thanks to their larger survey 
efficiency (compared to previous instruments), allow for studies of distant galaxies. 
Boosting their detecting efficiency with lensing magnification has shed new light in 
the nature of faint galaxies detected in these two wavebands (Fig. 16). 

Distant, z > 3, objects were supposed to be strong sub-mm sources because 
we do expect to detect redshifted dust-emission from these distant galaxies. 122 ** A 
first tentative application in sub-mm has been recently presented by Smail, Ivison 
and Blain. 121 ) It is now extended to cover seven clusters. 123 ) They found in total 16 
3d sources and 10 4<r sources at 850^m through all seven clusters. Their sub-mm 
galaxy counts greatly exceed those expected in a non-evolving model based on the 
local IRAS 60/im luminosity function 124 ) or in a model that can explain the optically 
selected Lyman-dropout 125 ) and Lyman-emission samples 126 ) (Fig. 16). Smail et 
al. 127 ) have attempted to find optical counterparts of these sub-mm sources using 
HST images. Counterparts were identified for 14 of the 16 3a sources. One of them 
is thought to be the distant ring galaxy LRG J0239-0134 shown in Fig. 15. The 
morphologies of the optical counterparts fall into disturbed/interacting objects and 
compact objects. The disturbed and interacting galaxies constitute the largest class, 
which suggests that interactions play an important role in triggering star formation 
and nuclear activity in the distant universe. 

A deep survey through cluster lenses in the mid-IR wave band (7/xm and 15/iin) 
using JSO/ISOCAM also has been performed. 128 ) Currently, a total area of about 
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Fig. 16. Cumulative sub-mm number counts through cluster lenses and in the field. 116 - ) The results 
obtained through lenses are indicated by solid squares. The results obtained by earlier works 
from Barger et al. (B98), 117) Ealse et al. (E98), 118) Holland et al. (H98), 119) Hughes et al. 
(Hu98), 120 ^ and Smail et al. (SIB) 121 - ) are shown by solid circles. The number counts through 
cluster lenses reaches down to a very faint flux density limit of 0.5mJy at 850/im, several times 
fainter than the confusion limit of the JCMT in blank fields. 120 - ) 



53 arcmin 2 has been covered in maps of three clusters. The source counts, corrected 
for cluster contamination and lensing distortion effects, exhibit an excess by a fac- 
tor of 10 with respect to the prediction of a non-evolution model. 129 ) The results 
suggest that abundant star formation occurs in very dusty environments at z ~ 1. 
As shown in Fig. 15, the distant ring galaxy LRG J0239-0134 is a bright mid-IR 
source. 112 ) The fact that the [Oil] line is more extended than the underlying stellar 
continuum indicates on going star formation in the ring. The detection of the [NeV] 
emission line indicates the existence of an active nucleus. These results indicate 
that a galaxy-galaxy interaction triggered the formation of the star-forming ring and 
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nuclear activity in LRG J0239-0134. 

These results indicate that interactions play an important role in triggering star 
formation and nuclear activity in the distant universe, and that care must be taken 
to take account of the dust in inferring global star formation history from UV and 
optical luminosities of high-z galaxies. 

§6. Arc statistics 

As in the case of the statistical study of the multiply imag ed QSOs, 13 °) it is 
natural to expect that the statistical study of arcs could be a powerful tool to study 
131) - 134) ^ cosm ological world models (and particularly to the cosmological 
density parameter Qq and the cosmological constant Aq), ii) the average cluster 
mass distribution/profile, and iii) the nature and evolution of high redshift galaxies. 
Aiming to extract fruitful information regarding these topics, statistical studies of 
arcs have been performed by many authors. These studies have been referred to 
as "arc statistics". Clearly, arc statistics depend on a large number of parameters 
which still contain large uncertainties, and we therefore here attempt to clarify its 
possible application in order to sharpen the focus for its future use. 

Wu and Hammer 135 ' provided the definition for a giant luminous arc (GLA) 
and performed the first quantitative comparison between the theoretically predicted 
number of GLAs and the observed number of GLAs. Because no observational 
results on systematic arc surveys had been reported, the frequency of finding GLAs 
in clusters was extrapolated from the reported number of GLAs at that time. They 
found that the frequency of finding GLAs in clusters is very sensitive to the degree 
of the central mass concentration in the clusters. They concluded that the observed 
high frequency of finding GLAs indicates a high central mass concentration in the 
clusters. The first well-defined GLA survey was conducted by Le Fevre et al. 15 ' 
They found 6 GLAs among 15 X-ray luminous clusters. (Although their original 
sample contained 16 clusters, one of their samples turned out not to be a cluster. 
137 )) They selected X-ray luminous clusters with z > 0.2 from the EMSS. The 
first quantitative comparison with these survey results was performed by Hattori, 
Watanabe and Yamashita. 136 ' They found that the theoretically expected number of 
GLAs within the framework of the circularly symmetric cluster potentials, which is 
consistent with X-ray observations, is two orders of magnitude less than the observed 
number. Molikawa et al. 137 ' found that the radial profile of Le Fevre et al. sample 
clusters have a large variation from cluster to cluster, and they are far from the radial 
profile adopted by Hattori, Watanabe and Yamashita. 136 ) However, Molikawa et al. 
137 ) reached the same conclusion as Hattori, Watanabe and Yamashita. 136 ) Hamana 
and Futamase 138 ) examined the GLA statistics by taking into account the evolution 
of the luminosity function found by the Canada-France Redshift Survey. 139 ' They 
showed that the observed evolution in the galaxy luminosity function causes the 
GLA number to increase by at most a factor of 2-3. These results suggest that more 
realistic modelling of the lens cluster potential, taking into account irregularities 
due to substructures and bright member galaxies, is important to account for the 
discrepancy. The importance of the irregularities is also claimed in the series of works 
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by Bartelmann and his collaborators 14 °) " 143 ' using numerical simulations. They 141 -* 
found that the irregular distribution of dark matter in clusters can increase the cross 
section of forming giant arcs by two orders of magnitude compared with the spherical 
symmetric mass distribution model. 

As for the next step of the arc statistics, if one considers the explosive progress 
in the refinement of the determination of the mass distribution in individual clusters 
by strong and/or weak lensing observations, constraining the mass distribution of 
clusters using GLA statistics may not be an efficient way. The arc statistics should be 
studied with the well-constrained mass distribution model of the individual cluster 
by the detailed lensing studies best with HST/WFPC2 data, with the help of X-ray 
data. This process must be done for at least the 15 clusters in Le Fevre et al. sample, 
because this sample is an unbiased sample and the best for a statistical treatment. 
Then the questions to be addressed using the arc statistics are those concerning 
the evolutionary nature of the high redshift galaxies and maybe these involved in 
constraining the cosmological world model. This kind of work has already been 
initiated by Bezecourt. 144 -* Bezecourt studied the statistics of an arclet in A370 
because the cluster mass distribution of A370 is well-constrained by lensing. It was 
shown that using a realistic cluster potential is essentially important to recover the 
observed number of arclets. 

Several meaningful results for probing the high redshift universe using arc statis- 
tics have been reported. Miralda-Escude 133 ' showed that the number of GLAs re- 
sponds sensitively to the intrinsic size of galaxies, and it decreases drastically when 
the intrinsic size of galaxies becomes smaller than the seeing FWHM. Hattori, Watan- 
abe and Yamashita 136 - ) showed that the apparent-unlensed source size distribution 
of GLAs has a sharp peak at the seeing FWHM. The seeing FWHM of the current 
arc survey of 0."8 15 ) is about the size of the L* spiral galaxies at z ~ 1 if there is no 
evolution in the source size, and the drop of the number in the larger source size is 
due to the exponential decrease of the galaxy luminosity function over L*. There- 
fore, it is expected that the source galaxies of GLAs are mostly the spiral galaxies 
with z < 1. Wu and Hammer 135 ' and Hattori, Watanabe and Yamashita 136 ) con- 
firmed this. Hattori, Watanabe and Yamashita 136 ) pointed out that the fraction of 
GLAs which are spheroidal galaxies nearly in their forming epoch in the observed 
GLAs is relatively large if the SED of the UV bright elliptical is a good representa- 
tive of the majority of elliptical galaxies. It is worth noting that a fraction of the 
GLAs and multiply imaged galaxies are made from the high redshift galaxies with 
z > 2. 145 )' 146 )' m ) This may suggest either a lacking in our current understanding of 
the evolutional features of the high redshift galaxies or that high redshift elliptical 
galaxies in the formation process started to be detected as GLAs. Quantification of 
the star formation rate in the GLAs by multi-band observations, especially in MIR 
and sub-mm wave bands mentioned in §5, may answer to this question soon. 

A more extended arc survey for an unbiased sample containing a much larger 
number of clusters is of course important. 147 ' The cluster sample provided by the 
ROSAT all-sky survey is undoubtedly a good unbiased sample for the arc survey. 
It is also expected that a new, better cluster sample will be provided by an all-sky 
survey in the 2-10keV band, which is more appropriate for detecting clusters than 
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the ROSAT band, planned to be performed by the X-ray satellite ABRIXAS 148 ) 
soon. 

Topics in connection with cosmological constraints are presented in the next 
section. 

§7. Cosmological constraint by cluster lensing 

7.1. Constraints from arc statistics 

Bartelmann et al. 143 ** performed a set of cosmological iV-body simulations, and 
found that the optical depths for the formation of giant arcs is sensible for the 
cosmological world models and is the largest for the low Qq cold dark matter model 
(OCDM). They concluded that the OCDM is likely to be the only model which can 
match current observations of the giant arc survey. These differences originate from 
the different epochs of cluster formation among the cosmological models. They found 
that clusters with z ~ 0.2-0.6, which are the most responsible for the formation of 
arcs, in the OCDM exhibit the most irregular shape in mass distribution, and as a 
result they have the strongest shear, since the cluster formation epoch is just around 
this redshift range in this model. This is very interesting and a practical variation 
of the idea that uses the degree of inhomogeneity in cluster mass distributions to 
constrain Qq first proposed by Richstone, Loeb and Turner. 149 ) One of the weakness 
of this kind of method is that the observable, that is the degree of inhomogeneity, to 
be compared with the model prediction is somewhat abstract. However, Bartelmann 
et al. 143 ) made quantitative comparison possible by connecting with arc statistics. 
However, the contribution of the bright member galaxies are not taken into account 
because of their numerical resolution limit, although they clearly have central role in 
cluster strong lensing, as shown in §3. Further improvement of numerical simulations 
is required. 

The redshift distribution of the GLAs have been proposed to constrain the cos- 
mological constant. 15 °) However, the redshift distribution of the GLAs is very sen- 
sitive to the mass distribution model of the cluster. It requires a very accurate 
modelling of the cluster mass distribution to constrain the cosmological constant. 
Further, the expected number of GLAs in one cluster is less than 1. We must per- 
form an arc survey for a large number of clusters for which the mass distribution is 
precisely constrained. This will be a difficult task, but it is required to be done. 

7.2. Cosmological constraints using {Qq,Aq) dependence of Dd s /D s and to(z) 

The combination of the angular diameter distance appears in the lens equation 
as D^s/Ds, an d its value depends only on the cosmological constant. 151 ) Using this 
fact, several methods to constrain the cosmological constant have been proposed. 
The merit of this kind of method is that one cluster is enough to provide a stringent 
constraint on the cosmological constant. However, the demerit is that the mass 
distribution of the cluster must be measured very precisely. One variation of this 
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method is using the lensing factor u(z) defined as 

(7-1) 

The convexity of the lensing factor has a (J?o, A)) dependence. Gautret, Fort and 
Mellier 161 ) proposed the triplet method, in which (Qq,Aq) is constrained by using 
this dependence. The advantage of the triplet method is that it does not require 
the accurate measurement of the cluster mass distribution. Although the Freedman 
distance is used as the angular diameter distance in these methods, it is likely that 
there is a variation of the angular diameter distance from line-of-sight to line-of- 
sight due to the inhomogeneity of the universe. 152 )' 153 ^ Therefore, the quantitative 
calibration of the variation of the angular diameter distance is an important comple- 
mentary works to extract reliable results from these methods. In this section, three 
methods proposed to constrain (l?o, A)) following the above mentioned principle are 
summarized. 

7.2.1. Constraints from depletion curve 

Broadhurst 154 ) first found that the galaxy number count in the direction of 
the cluster exhibits diagnostics which are now called the "depletion curve". Since 
the detail nature of the depletion curve depends on the magnification due to the 
cluster lens, he proposed to use it for measuring the cluster mass. Fort, Mellier and 
Dantel-Fort 155 ^ pointed out that the depletion curve can be used to constrain the 
cosmological constant if the mass distribution of the cluster is accurately measured. 
The expected galaxy number density in the direction of the lensing cluster is 

N(r) = N 0f i(r) 2 - 5a -\ (7-2) 

where \x is the magnification at the position r and a = d log(TV) /dm is the slope 
of the galaxy number counts. The first term of the exponent, 2.5a, describes the 
increase of the galaxy number density, since fainter galaxies become brighter than the 
limiting magnitude due to the lensing magnification. The last term of the exponent, 
— 1, describes the decrease of the galaxy number density, since the unit solid angle 
area of the unlensed source plane becomes a factor of fi larger due to lensing, and 
the galaxy number within the unit solid angle of the lensed source plane decreases a 
factor of 1/ fi. Since the slope of the galaxy number counts is ~ 0.17 ± 0.02, 155 ) " 157 ) 
less than 0.4, the galaxy number density through a cluster decreases. Therefore the 
radial profile of the galaxy number counts through a cluster is called the "depletion 
curve" . The main idea to constrain the cosmological constant by using the depletion 
curve, is as follows. At the Einstein ring radius the magnification is infinite and the 
depletion curve reaches zero as its minimum value. The Einstein ring radius becomes 
larger for a higher redshift source and converges into a finite radius for sources at 
the infinite redshift. The Einstein ring radius for sources with infinite redshift is 
referred to the "last critical line". The observed deletion curve is the superposition 
of these depletion curves of different source redshifts. The last critical line can be 
identified as the dip in the depletion curve appearing at the largest radius. Since 
the radius of the last critical line is larger for larger cosmological constants for a 



46 



M. Hattori, J. -P. Kneib and N. Makino 



fixed cluster mass distribution model, the cosmological constant can be constrained 
by identifying the last critical line for the cluster for which the mass distribution 
is well constrained. Fort, Mellier and Dantel-Fort 155 ^ applied it to CL0024+17 and 
constrained the cosmological constant to be larger than 0.6. The merit of this method 
is that the cosmological constant can be constrained with only one well-studied 
cluster. However, the cluster mass distribution model used by them is likely to be 
far from reality since there is evidence of the superposition of two large clusters in 
the line-of-sight, 49 ** and the detection of the last critical line for this cluster is also 
not so clear. A revised lens modelling and observational studies with deeper imaging 
are necessary to extract a clear conclusion on the value of the cosmological constant. 

7.2.2. GRAMORs 

Futamase, Hattori and Hamana 158 ) predicted the existence of a highly magni- 
fied gravitationally lensed yet morphologically regular images (they referred to as 
GRAMORs) in clusters of galaxies and proposed a new method to constrain the 
cosmological constant by using the GRAMORs. 159 )' 146 ) A marginally critical lens 
cluster may have GRAMORs if the source galaxy is located close to the line of sight 
through the cluster center. The lowest source redshift for which the cluster can have 
GRAMORs is lower for a larger cosmological constant for a fixed cluster mass dis- 
tribution model. Clusters with intermediate X-ray luminosities are favored to give 
a lower limit on the cosmological constant, since the difference of the possible low- 
est source redshift of the GRAMORs with and without the cosmological constant 
is largest for those clusters. If one identifies a very low redshift GRAMOR, it is 
sufficient to give a stringent lower limit on the cosmological constant. The merit 
compared with the depletion curve method, where we have to rely on the photomet- 
ric method to measure the source redshift, is that the redshift of the GRAMORs is 
spectroscopically measurable. 

Although these two methods require only one cluster to constrain the cosmolog- 
ical constant in principle, a precise measurement of the cluster mass distribution is 
required. It is practically very hard. 

7.2.3. The triplet method 

Attempts to disentangle the coupling between the cosmological parameters, the 
source redshift and the lens modelling have been made by Lombardi and Bertin 16 °) 
and Gautret, Fort and Mellier. 161 ) In particular, the triplet method proposed by 
Gautret, Fort and Mellier 161 ) does not require to know the detail of the cluster mass 
distribution and is summarized in this section. The ellipticity parameter, e, for a 
circular source at z% is obtained by 

(7-3) 



1 WjKoo 



where the subscripts i refers to the redshift z,, 7^ and the shear and the 

convergency for the source at infinite redshift, and the ellipticity is defined as 
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Here r is the axis ratio of the image isophotes. Take three galaxies with different 
redshifts at Zi, Zj and z k , which are located close together in the sky through the 
lensing cluster and name these "triplet" . Because of the proximity of the triplet in 
the sky, the local convergency Kqo and shear can be assumed equal all three. The 
triplet of galaxies gives three sets of Eq. (7 # 3). The quantities Kqq and 7oo can be 
deleted from these equations, and a final equation independent of both on Koo and 
7oo can be derived, such as 



1 LOi uJigjgk 
1 ujj ujjg k gi 
1 u k L0 k 9i9j 



0. (7-5) 



The final equation is satisfied only when we assume correct values of the real uni- 
verse for (J?o,A)). Then the problem is reduced to determining for the (Qq,Aq) 
values which satisfy the final equation. Unfortunately, the method is degenerate 
in ([2q,Aq). Therefore, a complementary cosmological constraint, such as a high- 
redshift supernovae, 162 ) is required to break the degeneracy. It is also pointed out 
that the triplet method is able to measure directly the curvature of the universe, 
1 — Qq — Aq, because of the mean orientation of its degeneracy. Although in the 
above discussion the source intrinsic ellipticity was omitted, errors introduced by 
the intrinsic ellipticity must be taken into account. Since the orientation and the 
ellipticity of the source intrinsic shape are likely to be random, it will be canceled out 
if we average over many sources. Errors in the source redshift due to the limitation 
of the photometric redshift method and systematics due to the possible multiple lens 
plane, e.g., the lensing of the galaxies at Zj and z k by a galaxy at Zj, where Zi is the 
lowest within the triplet, also introduces errors in the obtained cosmological param- 
eters. Therefore, applying the triplet method to many clusters is desired. Gautret, 
Fort and Mellier 161 ) have undertaken an intensive study on possible error estimates 
and have concluded that the observation of 100 clusters can separate the i?o = 0.3 
and J?o = 1 universe with a 2a confidence level and could be a good subject for VLT 
and NGST. 



§8. Conclusion 



We have seen how fruitful the cluster lensing studies are. Surprisingly, almost 
all areas described in this review are realizations of Zwicky's dream. It is no doubt 
that cluster lensing will play central role as one of the standard techniques to study 
the evolutionary history of the mass distribution of the universe, to probe the deep 
universe, and to constrain the cosmological world models in the next decade with 
the combination of revolutionary new observational instruments from radio to X-ray, 
which already have emerged and will emerge during the next decade. 
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